Lecture 1 8.321 Quantum Theory I, Fall 2017 1 


Lecture 1 (Sep. 6, 2017) 


1.1 A “Weird” Example in Quantum Mechanics 


Suppose we have an atom with spin-5, and we measure the z-component of its spin, S*. The first 
claim of quantum mechanics that differs from classical mechanics is that we will always find one of 
the two values, +8, where h = e. 

The first thing we should wonder is, “How can we reasonably make this measurement?” There 
is a standard setup for such a measurement known as the Stern—Gerloch filter. We shoot a beam 
of these atoms through a region of spatially-varying magnetic field B = B(z)Z, such as the region 
between a square north-pole magnet and a pointed south-pole magnet. When travelling through 
this region, spin-up and spin-down particles will deflect in opposite directions, allowing us to filter 
the particles by their spin. 

The reason this works is that the energy of the magnetic dipole moment of the atom in the 


magnetic field is 


E=-p-B=-pn-B., (1.1) 
and the force on the atom in the z-direction is thus 
OE dB, 
; . 1.2 
. Oz ve dz (2) 


Thus, states with different values of w, will deflect in different directions along the z-axis. 

Now, let’s imagine we have such an apparatus that can filter out the components of spin. We 
could modify this apparatus by changing its axis of orientation, and in this way we could create 
filters for spin along each of the z-, y-, and z-axes. Imagine then a sequence of filtering protocols 
to which we can subject our beam of atoms. 

First, suppose we use this device to select out the S$* = +4 atoms, and then measure 5S* once 
again. This can be done by feeding the spin-up output of one z-oriented Stern—Gerloch filter into 
the input of a second z-oriented Stern—Gerloch filter. With this setup, we will find that 100% of 
the output of the second Stern—Gerloch filter will be spin-up. 

Now consider altering this setup so that the second filter is oriented along the x-axis, so we are 
sending the atoms through a z-axis filter and then an z-axis filter. The claim in quantum mechanics 
is that we will now find that S” = +4 50% of the time, and S* = -! 50% of the time. 

Now let’s complicate matters further by taking the S* = +5 output of the z-axis filter and 
feeding it into yet another z-axis filter. The claim is that we will find S* = +2 50% of the time, 
and S$? = -! 50% of the time. We have filtered out atoms that are only spin-up along the z-axis, 
then from among these we have filtered out only those that are spin-up along the x-axis, and then 
measured the spin along the z-axis again, and found that the atoms are equally likely to be spin-up 
and spin-down along the z-axis. 

The reason for this phenomenon is that in quantum mechanics, S* and S* are incompatible 
observables, and cannot be measured simultaneously with complete accuracy. As far as we know, 
this behavior is inconsistent with a classical description of the state of a spin-5 particle. 


1.2 The Fundamental Postulates of Quantum Mechanics 


We will take the following as postulates of a quantum mechanical system: 


1. The state of a quantum mechanical system at a fixed time t is given by a mathematical object 
known as a vector (or more precisely, a ray) |w), which is an element of a complex Hilbert 
space H. 


Lecture 1 8.321 Quantum Theory I, Fall 2017 2 


2. Observers cannot measure the state directly, but rather can only measure observables, which 
are Hermitian operators that act in the Hilbert space H, and whose eigenvectors form a 
complete set. 


3. A measurement of an observable A returns one of its eigenvalues. 
a. If A is measured in a state \w), then the probability that A = a is given by 


(ab| Male), (1.3) 


where 


Ma= > |a;)(a5| (1.4) 


jiaj=a 
is the measurement operator. Here, the sum is over all states |a;) with eigenvalue a, i.e., 
Ala;) = ala). 
b. After measurement, the system is in the state 


Ib) o Mal). (1.5) 


This is an often-discussed feature of quantum mechanics, referred to as the “collapse” 
of the wavefunction. 


4. Time evolution is carried out by a map 


IW) > W(t)) = Te Ov), (1.6) 


where U(t',t) is the time-evolution operator, which is a unitary operator, U'U = 1. The 
infinitesimal form of the time-evolution operator, when t’ and ¢ are infinitesimally separated, 
takes the form 


ace A 
ihe WO) = Av), (1.7) 


where H is a Hermitian operator called the Hamiltonian. 


1.3. Mathematical Preliminaries 


1.3.1 Hilbert Spaces 


We will begin by recalling the definition of a vector space. A vector space V is a collection of 
objects {|a)} with several properties: 


1. There is a well-defined addition operator ‘+’ defined on this collection, so that for any 
la), |G) € V, there exists a unique state 


la) + |8) = ly) eV. (1.8) 
a. We demand that the addition operator is commutative: for all |a),|G) € V, 
|a) + |8) = |8) + la) . (1.9) 
b. We demand that the addition operator is associative: for all |a),|3),|y) € V, 


(la) + [6)) + ly) = la) + (16) + I9))- (1.10) 


Lecture 1 8.321 Quantum Theory I, Fall 2017 3 


2. We require the existence of a null vector |0) € V with the property that 
|0) + Ja) = |a) (1.11) 
for all ja) EV. 


3. For all |a) € V, we require the existence of an inverse state —|a) € V with the property 
|a) + (—|a)) = |0). (1.12) 


4. There is a well-defined scalar multiplication operation: for some field F (such as R or C), we 
can multiply any state |a) € V by any scalar c € F, yielding a unique state cla) € V. 


a. For all c,d € F and |a) € V, scalar multiplication must satisfy 
c(d|a)) = (ed)|a) , (1.13) 


where cd indicates multiplication in the field F. 


b. For all ja) € V, the multiplicative identity element 1 € F must satisfy 
1a) = |a). (1.14) 


c. Scalar multiplication must distribute over addition in the vector space: for all c € F and 
|a), |G) € V, we must have 


c(|a) + |8)) = ela) + eB) (1.15) 


d. Scalar multiplication must distribute over addition in the field of scalars: for all c,d € F 
and |a) € V, we must have 


(c+d)|a) = cla) +dla). (1.16) 


If the field of scalars is F, then we refer to V as a “vector space over F.” A vector space V is 
called a real vector space if its field of scalars is the field of real numbers, F = R, and a complex 
vector space if its field of scalars is the field of complex numbers, F = C. 

As a familiar example, we might consider the vectors in R” of the form 


CHO Ons (1.17) 


These form a real vector space under addition. We could also consider the space of states of a 
spin-5 particle, 
ealt) +e[+), (1.18) 


with c+ € C, which form a complex vector space. A more interesting example of a vector space is 
the space of all real functions f(x) on the interval (0, 1]. 

A subset V C W of a vector space W is a subspace of W if it itself is a vector space over the 
field of scalars of W. A ray is a subspace of a vector space V of the form {cla) | c € F}, for some 
la) EV. 

A collection of vectors |aj),...,|Q@,) € V is linearly independent if and only if the statement 


n 


S © ejlaj) = 0 (1.19) 


j=l 


Lecture 1 8.321 Quantum Theory I, Fall 2017 4 


implies that c; = 0 for all 7 = 1,...,n. A collection of vectors {|a;)} is maximally linearly 
independent if it is linearly independent and there exists no state |3) € V such that the collection 
{|a;),|G)} is linearly independent. In this case, {|a;)} is called a basis for V, and the number of 
elements n in the collection is called the dimension of V, denoted dim V. The dimension of a vector 
space can be finite or infinite, and if infinite, can be either countable (also called denumerable) or 
uncountable. 

Let’s consider some examples. A finite-dimensional vector space with which we are already 
familiar is R”, which has dimR” = n. An example of a infinite-dimensional vector space is the 
space of square-integrable functions on the interval [0,1] with f(0) = f(1) =0. This is a countable 
vector space: we can expand such a function f(a) as 


(oS S> cr sin(rka) , (1.20) 
k=1 


which provides a countable basis of functions for this space. 

Similarly, we can consider the vector space of square integrable functions on (—oo, 00). More 
surprisingly, this vector space also has countable dimension. A clean way to see this is to produce 
a countable basis for this space: one such basis is the set of eigenfunctions of the simple harmonic 
oscillator. These eigenfunctions are all square-integrable, and any other wavefunction can be ex- 
panded in this basis. The oscillator eigenfunctions are labelled by integers, so this shows that this 
space is countable. This may seem confusing, because another basis we might use is the position 
basis, where our basis elements are Dirac delta functions at each position; the problem with this 
approach is that the Dirac delta function is not itself square-integrable. 


Lecture 2 8.321 Quantum Theory I, Fall 2017 5 


Lecture 2 (Sep. 11, 2017) 


2.1 More Relevant Math 
2.1.1 Inner Products 


Last time, we discussed the concept of a maximally linearly independent set, which is a set {|a;)} 
of vectors that are linearly independent, and such that there exists no |3) such that {|a;),|)} is 
linearly independent. In this case, {|a;)} forms a basis for V. If |a1),...,|@n) form a basis for V, 
then any vector |3) can be decomposed in terms of the basis vectors as 


|B) = S| cil) ; (2.1) 


a 


Next, we discuss the concept of an inner product (we will assume that V is a vector space C). 
A (Hermitian) inner product is a map (-,-): V x V > C with the following properties: 


1. For all c € C and |a),|3) € V, we have 


(le), |) = e(la), |8)) ; 
(cla), |6)) = e*(la), 18) ; 


where c* is the complex conjugate of c. 


(2.2) 


2. For all ja), |8) eV, 
(la), 18)" = (12); |e) - (2.3) 


3. For all |a), |8),|8’) € V, 
(Ja), |8) + |6’)) = (le), 18)) + (la), |5")) - (2.4) 


4. For all ja) € V, 
(la), |a)) > 0, (2.5) 


and if ({a), |a)) =0, then |a) is the zero ket, |a) = 0. 


Let’s consider some examples. First, consider V = C”, whose vectors are of the form 


Izy=] i |, Zy--.m€EC. (2.6) 


en, 
One definition of an inner product on this space is 
(\z), |w)) = zpwi + 23wa +--+ 2nWn, (2.7) 


which we can check satisfies all of the required properties of an inner product. 
A more exciting example is the vector space of square-integrable functions on the interval (0, 1], 


vef{royse| far rey) <oo}, (2.8) 


Lecture 2 8.321 Quantum Theory I, Fall 2017 6 


as we discussed in the last lecture, which can be endowed with the inner product 


1 
(f,9) = [ de f*(a)g(a). (2.9) 


Now we introduce a bit of terminology. The vectors we have been discussing are referred to as 
kets. Two kets are orthogonal if (\a),|3)) = 0. The norm of a ket is given by 


N := v/(la),|@)) := |lla)ll- (2.10) 
Using this norm, we can normalize any nonzero ket by defining 
1 
la) = le). (2.11) 
which has 
(\a),|@)) = 1. (2.12) 


2.1.2. Dual Space 


Now we introduce the dual space. The space V that we have described so far is the space of kets 
|\a). The dual space V* is the space of objects called bras, which is “dual” to the ket space. To 
every |a@) € V, we associate a bra (a| € V*, and define an operation between bras and kets, 


(a8) == (|e), |8)). (2.13) 
This shows us the motivation for these names: when we combine a bra and a ket, we get a 
“bra(c)ket.” 
Consider the space of linear functionals y: V — C. We introduce the notation y > (y|, such 
that for any basis {|a;)}, the statement 
mys lots) > Gy (2.14) 
implies that 
(yay) = ¢- (2.15) 
We see that the dual space is just the space of linear functionals on V. The {y} form an n- 
dimensional vector space V*, where n = dimV. We then have a duality V © V*, given by 
la) + (a|. Note that scalars are conjugated under this duality, so that for all c € C and |a) € V, 
cla) & c*(al. 


2.1.3 Orthonormal Bases 
An orthonormal basis is a basis {|@;)} such that 


(Pildj) = diy , (2.16) 


where 6;; is the Kronecker delta. We know that we can write any ket in the form 


jo) = So eildi). (2.17) 


a 


If the |¢;) form an orthonormal basis, then we find that c; = (¢;|a). Thus, if we have an orthonormal 
basis, then we can write any ket in the form 


la) = Dd lei(¢ila). (2.18) 


These is an example of a completeness relation. 


Lecture 2 8.321 Quantum Theory I, Fall 2017 7 


2.1.4 Operators 


Recall the second postulate of quantum mechanics: observables are represented in quantum me- 
chanics as Hermitian operators acting on the Hilbert space H (from here on out, we will consider 
states in the Hilbert space H rather than an arbitrary vector space V). In order to understand this 
statement, we first need to understand what an operator is, and then we must understand what a 
Hermitian operator is. 
An operator is an object that acts on a ket and returns another ket. Thus, if X is an operator, 
then X acts on a ket Ja) € H as 
X(|a)) := Xla) EH, (2.19) 


where the right-hand side is a naming convention. wo operators A and B are said to be equal 
(A = B) if and only if Ala) = Bla) for all ja) EH. 

We introduce an addition operation ‘+’ on the space of operators, which is commutative and 
associative: for all operators X,Y, Z, we have 


KEY HV EX, 


X+(V¥+Z)=(X4+Y)+Z. ey) 


We will primarily be interested in linear operators: a linear operator X satisfies the property 
X (Cala) + ¢g|8)) = caX|a) + eX) (2.21) 


for all ca,cg € F and |a),|G) € H. We can also define the notion of an anti-linear operator: an 
anti-linear operator X satisfies 


X (Cola) + ¢g|8)) = co X|a) + 3X18) (2.22) 


for all ca, cg € F and |a),|8) € H. 

We can also define an action of these same operators on the dual space, which will be a right 
action of the form (3|X for (3| € H*. We define this object via its inner product with kets, by 
requiring that it satisfy the property 


((6|X)la) = (8|(X]a)) (2.23) 


for all |a) € H. Using an orthonormal basis {|¢;)}, we can then express this bra as 


(B|X = > 7(B|X195)(b51 - (2.24) 


J 


We also define multiplication of operators, such that for all operators X,Y and |a) € H, 
(XY )|a) = X(Yla)). (2.25) 
This multiplication is not necessarily commutative, XY # YX, but it is associative, 


X(YZ) =(XY)Z = XYZ. (2.26) 


Lecture 2 8.321 Quantum Theory I, Fall 2017 8 


2.1.5 Operators as Matrices in a Given Basis 


In a given basis, the action of an operator can be expressed by a matrix. To see this, we first define 
the identity operator 1, which satisfies 
1a) = |a) (2.27) 
for all |a) € H. Given an orthonormal basis |{a’}) (notation from Sakurai), the identity operator 
can be expressed in the form 
1= 5 |a’){a'|. (2.28) 


We can use Eq. (2.28) to write any operator X in the form 
X=1X1 


= DLiloMallle"y(o" | 


Note that (a’|X|a”) € C is simply a number. Thus, all information about X is equivalently 
contained in the complex numbers (a’|X|a’), given any orthonormal basis |{a’}). This defines an 
n Xn matrix with complex entries corresponding to each operator X. For this reason, we will often 
use the words “operator” and “matrix” interchangeably if the chosen basis is clear, even though 
the concept of an operator is more fundamental. 


2.1.6 Adjoint Operators 


For each operator X, we define its adjoint operator (also called the Hermitian conjugate) X* to be 
the operator satisfying 

Xa) o (alX (2.30) 
under the duality H o H*. If Xi = X, then the operator X is called self-adjoint or Hermitian. 
(Note that in infinite-dimensional spaces, the concepts of self-adjointness and Hermiticity are tech- 
nically distinct. We may discuss this later in the course.) Note that (XY)! = Y'XT. The proof of 
this fact is straightforward: For arbitrary |a) € H, consider the kets defined by 


|8)=Yla), ly) = X18). (2.31) 
Their dual bras are then defined by 
(al¥T = (6],  (8|XT = (y. (2.32) 
From these definitions, we have 
XY|a) =|7), (al¥TXT = (y]. (2.33) 
Taking the dual of the first equation in (2.33) yields 
(al(XY)' = (1, (2.34) 


which we can compare with the second equation in (2.33) to yield (XY)! = YtXT. 

This same concept has a well-defined meaning for matrices. As we have seen, any operator has 
the same information content as an n x n matrix, where n = dim H. If M is a matrix corresponding 
to the operator X, then Xt corresponds to the matrix found by conjugating the entries of the 
transpose M7. This matrix is denoted as M1, and is also called the Hermitian conjugate. A matrix 


satisfying Mt = M is called a Hermitian matrix. 


Lecture 2 8.321 Quantum Theory I, Fall 2017 9 


2.1.7 Operator Examples 


As a first example of an operator, we define the outer product. Given a ket |a) € H and a bra 
(8| € H*, we define the operator 


O = |a)(6| (2.35) 
to be their outer product. This is an operator: for all |y) € H, we have 
Oly) = |e)(8l1) . (2.36) 


Taking the dual of this equation, we find 


(0" = (Bl7)*(al 


= (|B) al, mee 


and so we conclude that 


OF = |8)(al. (2.38) 


As a second example, consider the Hilbert space H. of square-integrable functions on (—co, co), 
with inner product 


alta) = f ” ae ft (a) fol) (2.39) 


for | f1), | fo) € H. Now consider the operator A = a Using integration by parts, we see that 


- dx f@)< halo) - oe ax(-E A) fo(x) , (2.40) 


telling us that 


d d ‘ 
— =— — : 2.41 
(fi re fa) (f a fi) (2.41) 
Thus, 
d\i d 
—)] =-—. 2.42 
(=) dx ( ) 
Operators satisfying At = —A are called anti-Hermitian. We can make this into a Hermitian 
operator by multiplying it by the imaginary unit (or its negative): 
d\t d 
} =-1 : 2.43 
( it) “dx oe) 


You may recognize this as the momentum operator in one-dimensional quantum mechanics. 
A third example is the identity operator 1, which we have already seen. 
For some operators, we can define an inverse operator: for an operator A, its inverse operator 
A~' (if it exists) satisfies 
AS AS AAS 1, (2.44) 


Inverse operators are not guaranteed to exist. 
Of special interest to us are unitary operators: a unitary operator U is one that satisfies 
U-! =Ut. These are useful because they preserve the inner product: 


((g|0*) Wla)) = (8|U-U|a) = (Bla). (2.45) 


We are also interested in projection operators, which are operators A satisfying A? = A. An 
example of a projection operator is A = |a)(a|, for some |a) € H. 


Lecture 2 8.321 Quantum Theory I, Fall 2017 10 


2.1.8 Eigenstates and Eigenvalues 


If, for some operator A and ket |a) € H, we have 
Ala) = ala), (2.46) 


with a € C, then |a) is an eigenket or eigenstate of the operator A, with eigenvalue a. The spectrum 
of an operator A is the set of its eigenvalues {a}. 

We now prove an important theorem regarding the eigenkets and eigenvalues of Hermitian 
operators. 


Theorem 1 (Spectral Theorem). If A is a Hermitian operator, A= At, then 
1. all eigenvalues a; of A are real, and 
2. eigenkets of A with distinct eigenvalues are orthogonal. 
Proof. Suppose that for some Hermitian operator A and kets |a’),|a”) € H we have 
Ala’) = a’ |a’) : (2.47a) 
(a"|A = (a"|(a")*. (2.47b) 
Multiplying Eq. (2.47a) by (a”| on the left and multiplying Eq. (2.47b) by |a’) on the right yields 
(a”| Ala’) = a’ (a"\a’) : (2.48a) 
(a"| Ala’) = (a”)"(a""|a’) (2.48b) 
Comparing Eqs. (2.48a) and (2.48b), we see that 
[(a””)” —a'| (a"|a’) =. (2.49) 


If ja’) = |a”), then this proves that (a’)* = a’, which means that a’ € R. If |a’) 4 |a”) and we have 
a’ # a”, then this proves that (a’|a’) = 0, i-e., ja’) and |a”’) are orthogonal. This completes the 
proof. 


If we normalize each eigenket |a;) of a Hermitian operator, so that (a;|a;) = 1, and have chosen 
the eigenkets so that distinct eigenkets with the same eigenvalue are orthogonal, then Thm. 1| tells 
us that we have 

(aj|a;) = bi; 5 (2.50) 


i.e., the normalized eigenkets are orthonormal. We can then decompose 
A=) ala)(al. (2.51) 
a 


We can check that for some eigenket |b), we have 


Alb) = 5 © ala) (ald) = S" ala) day = b)d). (2.52) 


a a 


Note that the equation 


1=5_Ja){al (2.53) 


is simply a special case of Eq. (2.51). 


Lecture 3 8.321 Quantum Theory I, Fall 2017 11 


Lecture 3 (Sep. 13, 2017) 


3.1 Even More Math 


3.1.1 More on Matrix Representations 


Last time, we described that in a given basis, there is an exact correspondence between nn matrices 
and operators, where n is the dimension of the Hilbert space. Let {|a)} form an orthonormal basis, 
so that any state |a) can be expanded as 


jo) = S$“ (ala)|a) = Scala) « (3.1) 


a 


Similarly, any operator X can be expressed in the form 


X = Yo lai) (ai|X|a;)(a5] - (3.2) 


a4 ,05 
In this basis, all of the information of the operator X is contained in the matrix 
Xij = (ai|X|aj) . (3.3) 


The terminology is that the X;; are matrix elements of X between states |a;) and |a;). As an exer- 

cise, you can check that the matrix elements of the product operator XY are given by )> j XigVjr, 

where X;; and Yj, are the matrix elements of the operators X and Y in some basis, respectively. 
We define the trace of an operator A as 


TrA = S°(a|Alai) = S> Au. (3.4) 
i i 
If the |a;) are chosen to be eigenvectors of A, then Aj; = a;, so the trace becomes 
TrA=S a. (3.5) 
i 
This is a statement of the familiar fact that the trace of an operator is the sum of its eigenvalues. 


3.1.2. Unitary Transformations 


Suppose we are given two orthonormal bases {|a;)} and {|b;)}. How are these bases related? We 
can define an operator U by the action 


Ua:) = |bi). (3.6) 


This implies that 
(bi| = (ailU", (3:7) 


by definition of the adjoint. 
Note that we can write 


U=U1l= Ud lai) (as = > lei) (aul (3.8) 


Lecture 3 8.321 Quantum Theory I, Fall 2017 12 


so the operator U is simply the sum of the outer products of corresponding vectors from each basis. 
Then we have 
Ul = ¥ Vlas) (bil, (3.9) 
i 


from which we find that 


UUT =} Vbi)(ailaj) (b| 
i 
7h) (3.10) 


a 


Thus, we see that U is unitary. Unitary transformations are precisely those that transform from 
one orthonormal basis to another. 
Consider now a vector |@) and two distinct bases {|a;)} and {|b;)}. We can express |q) in two 


ways as 
jo) = S“cilai) = do dilbi) (3.11) 


v 


How are these two sets of coefficients {c;} and {d;} related? Using the definition of U, we can write 
Je) = $° djlb;) 
j 
=> /4Ula5) (3.12) 
j 
= 2 4jlai) (aU |aj) 
a,J 


Thus, we see that 
Cj = S-(ailU]aj)dj = Uijd; , (3.13) 
49 
where we have introduced the shorthand notation in which repeated indices are assumed to be 
summed over (from now on, we will explicitly state the cases when we are not using this convention). 
Note that the U;; are simply the matrix elements of the operator U. 
A similar approach can be used to show that for any operator that can be expressed in two 
different bases as 


X= > las) Xig (a5| = S_lbn) Yee(bel , (3.14) 


ke 


the matrix elements in the two different bases are related by 
SUR (3.15) 
We leave the proof as an exercise. 


3.1.3 Diagonalization of Hermitian Operators 


Theorem 2. A Hermitian matrix Hj; = (¢i|H|¢;) can always be diagonalized by a unitary trans- 
formation. 


Lecture 3 8.321 Quantum Theory I, Fall 2017 13 


Proof. Consider a general orthonormal basis {|¢;)}, and let {|h;)} be the orthonormal basis of 
eigenstates of the operator H — such a basis exists because H is Hermitian. Because these are 
both orthonormal bases, there exists a unitary transformation U such that |h;) = U|¢;). Using this 
operator, we can then write 


bighi = (hil H|hj) = (4:|UTHU|$5) , (3.16) 


where the first expression on the left is not summed over 7. Thus, we see that Ul ApeUo; is a 
diagonal matrix. This completes the proof. 


3.1.4 Simultaneous Diagonalization 


Theorem 3. Two (diagonalizable) operators A,B are simultaneously diagonalizable if and only if 
[A, B] = 0, where [-,-] is the commutator. 


Proof. Let there be a basis {|a;)} for which the diagonalizable operators A, B have 
Ala;) =a;|a;),  Bla;) = bjla;). (3.17) 
This is a basis in which A and B are simultaneously diagonal. In this case, we see that 
AB\a;) = a;b;|a;) = BAla;) , (3.18) 


and so AB = BA. This proves the forward direction. 
Now consider two diagonalizable operators A, B that commute, AB = BA, and let {|a;)} bea 
basis of eigenvectors of A, 
Ala;) = aj|aj) . (3.19) 
Then we have 
A(Bla;)) = AB|a;) 
= BA|a;) 
= Ba;|a;) 
= a;(Bla;)) , 


so Bla;) is an eigenket of A with eigenvalue a;. In general, this means that B is block diagonal 
in the basis {|a;)} (once we have ordered the basis so as to group eigenkets of A with the same 
eigenvalue), with each block corresponding to a single eigenvalue of A. We can then diagonalize 
B within each block, which will leave A diagonal, thereby simultaneously diagonalizing A and B. 
Visually, we have 


(3.20) 


A = ag ; B — Bo 5 (3.21) 


where the B; are kj x k; blocks, with k; the number of occurrences of the eigenvalue a; on the 
diagonal of A. Diagonalizing the block B; only mixes eigenkets of A with the same eigenvalue, so 
we can diagonalize B while leaving A diagonal. This proves the backward direction. 


Lecture 3 8.321 Quantum Theory I, Fall 2017 14 


3.2 Measurement 


Consider a quantum mechanical system. We know that the state of such a system consists of a 
normalized state |~) € H, and observables are represented by Hermitian operators acting in the 
Hilbert space H. Recall the third postulate, regarding measurement: 


a. The possible results of measuring A are the eigenvalues of a; of A. 


b. Once a measurement is done, and the result is a; for some 7, the system “collapses” to an 
eigenket |a;) with a sharp value of A: if we perform a measurement again instantaneously, 
we are guaranteed to get the same value as the first measurement. 


c. The probability that a measurement gives the result A = a; is given by 


Prob(A= ‘a; ):= S- (aja)? 


Jiaj=aj 
= © Wlaj)ajy) (3.22) 
Jiaj=a; 


= (|Ma,|¥) , 


where we have defined the measurement operator 


Mo c= YS) ajay (3.23) 


7:05 =Aj 


This operator is the projector onto the subspace with A = a;. 


If we simplify to the case where there is only one eigenket with eigenvalue a;. Then 


Prob(A = aj) = |(a;|) . (3.24) 


Historically, there has been a lot of worry about the collapse of the wavefunction. In the modern 
language, there is a way to understand this collapse in a very palatable way, which we may discuss 
later in the course. 

The concept of measurement is worth pondering. Consider making a measurement on a state, 
and then immediately making another measurement on it. Can we actually do this? Say our 
measurement is to see if there is a photon hitting our detector. Prior to measurement, the state 
consists of a detector and a photon. After measurement, there is a click, and the photon is gone. 
How can we remeasure? We must separately consider measurements that destroy our state and ones 
that do not. Measurements that do not destroy the state are called “non-demolition measurements.” 


3.2.1 Comments 


1. We have said that 
Prob(A = a;)= S> |{ajly)/?. (3.25) 


J:0j =A; 


This can only make sense if the value on the right-hand side is non-negative, and the sum of 
these probabilities over all eigenvalues is 1. It is clear that the right-hand side is non-negative, 


Lecture 3 8.321 Quantum Theory I, Fall 2017 15 


as is the sum of non-negative numbers, and we see that 


S " Prob(A a Sol (ag lb)? 
i j 


= la; (ashy) (3.26) 
J 


(|) 
1, 


because the state |W) is assumed to be normalized. 


2. For any observable A and state |), the expectation value of A is 


(A) = S- a; Prob(A = a,) 


= Soa YO Wlaj)(ajlb) (3.27) 


ai J:Aj=Aj 
= (p|Aly) , 


where in the final line we have used 


A=) -ailai)(ail. (3.28) 


3.3 Spin- Systems 


Consider a spin-3 system. The state space is spanned by the eigenstates of, for example, S*. We 
denote the state with $* = 4 by |+) and the state with S* = —4 by |—). As a set, the Hilbert 
space is then 

H = {|b) =c4|+) tal—-) | a €C, [ey]? + |e]? =1}. (3.29) 


This is a subspace of the two-dimensional complex vector space C?. 

The states |W) and e|w), for some \ € R, are physically equivalent by definition (the Hilbert 
space is actually the space above quotiented by this equivalence relation), so the only physically 
relevant phase information in a state is the relative phase of the coefficients (c;,c_). We can then 
parameterize these coefficients in the form 


cy =cos-, c_ =e sin-, (3.30) 
2 2 
withO <@<aand0<¢< 2z. Specifying these two angles specifies the state exactly, and we note 
that specifying these angles is equivalent to specifying a point on the surface of the unit sphere S?. 
This parametrization of the Hilbert space is known as the Bloch sphere. For example, the north 
pole of the Bloch sphere has c, = 1,c_ = 0, and so represents the state |W) = |+), while the south 
pole has c; = 0,c_ = 1, and so represents the state |7) = |—). 
We will now define several important operators in this space. The identity operator is 


aa; Gu es lowe ome (3.31) 


1= é ') (3.32) 


which can be represented by a matrix 


Lecture 3 8.321 Quantum Theory I, Fall 2017 16 


Keep in mind that using an equals sign here is an abuse of notation, as the matrix representation 
of an operator is formally distinct from the operator itself. The operator S* is given by 


ge. Slept 2 ee + eee 30 
s = Fel -D="F=5(5 8). (3.33) 
where o” is the third Pauli matrix. Similarly, we can write 


ah _ho® —h(0 1 
$= FUH-14 =" = 50g) 


2 2d 
ih hoy h(O i on 
y — = | = ue" = mee 
st = SFyayei-i-y =" =F (FG). 
We can check explicitly (left as an exercise) that these operators satisfy 
[S°, *) = ihet’e ge 
h2 
{s*,s?\ = 50 
D (3.35) 
Seer StS “WA 
Ses"), 
where [-,-] is the commutator, {-,-} is the anticommutator, and 7° is the totally antisymmetric 


three-index tensor (known as the Levi—Civita symbol). 


Lecture 4 8.321 Quantum Theory I, Fall 2017 21 
Lecture 4 (Sep. 18, 2017) 
4.1 Measurement 
4.1.1 Spin-3 Systems 
Last time, we said that a general state in a spin-5 system can be written as 
|b) = e4|+) + c_|-), (4.1) 


where |+),|—) € H are eigenstates of S* and c, 
between c+ and c_ was physically relevant. 


.,c_ € C. We noted that only the relative phase 


Assume that |) is normalized, so that (w|w) = 1, which implies that 


lev? + le? =1. (4.2) 
Now consider what happens when we measure S,. We find 
v h 
Prob( s* = +5) =[(+l0)? =leP, 
(4.3) 
Prob( s* =-5) = |(-lw)P =leP 


We now want to consider a measurement of s 


pin along an arbitrary axis n. In general, we have 


wt h — ake 
Prob S-n=t, = IG f= +4]y)| 5 (4.4) 
where 
|F-a=+4) (4.5) 
is the eigenket of the operator S-n with the eigenvalue +f, This uses the fact that the eigenvalues 


of the spin projection along any axis are always 

have chosen any axis originally as the quantiza 

expressing a general S'- 7 as a matrix and comp 
For example, the eigenkets of S” are 


EA. We can deduce this by noting that we could 
tion axis, but you should also try to prove it by 
uting its eigenvalues. 


1 
oi = +h = +) = 5 4.6 
| 2) = allt) £1-)) (4.6) 
with eigenvalues +f We can prove this using the fact that S®*” corresponds to the matrix 
tO. DPA 
S* 5 =3(t ae (4.7) 
We can then compute 
h 
Prob(s* +3) (s* = +4|p)/? 
1 2 
= |e((+| + (-|)(e4|+) + c-|-)) 
v2 : (4.8) 
1 
= |—=(c- +c — 
Jas af ) 


Lecture 4 8.321 Quantum Theory I, Fall 2017 22 


4.1.2 The Stern—Gerloch Filter Revisited 


Recall the experiment discussed in the first lecture, in which we feed a beam of spin-5 atoms through 
a region of spatially-varying magnetic field, resulting in the beam splitting into two beams, one of 
which has $* = +4 and the other of which has S* = —. We discussed several thought experiments 
using such filters oriented along various axes: 


1. First, we discussed feeding the beam through a z-axis filter, and then taking the S* = +4 
output and feeding it through yet another z-axis filter. The result is that all of the atoms 
come out of the filter with S* = +8. This corresponds to the collapse of the wavefunction; 
the wavefunction has collapsed after the first measurement, and so the second measurement 
returns the same result without further altering the wavefunction. 


2. Next, we considered feeding the beam through a z-axis filter, and then taking the S* = +4 
output and feeding it through an z-axis filter. The result is that 50% of the atoms come out 
with each of S” = +f. We can see this from the calculation in Eq. (4.8), with ce, = 1,c_ = 0. 


3. Finally, we considered feeding the beam through a z-axis filter, feeding the S* = + output 
through an x-axis filter, and then feeding the S* = +f output through another z-axis filter. 
The result is again that 50% of the atoms come out with each of S* = +f, In this context, 

we reached the conclusion that we cannot simultaneously measure S* and S*. These are 


incompatible observables. 


4.2 Compatible and Incompatible Observables 


We have seen that S* and S* cannot be simultaneously sharp (just like z and p). When can two 
observables be simultaneously sharp? Let A,B be two Hermitian operators corresponding to two 
observables. If we measure A, then the wavefunction collapses to one of the eigenstates |a) of A 
with eigenvalue a. We next want to measure B and see what happens. Suppose that |a) is also an 
eigenstate of B with eigenvalue 6; in this case, measuring B will return the corresponding eigenvalue 
b. If we remeasure A, then we will once again find the result a. Any measurements of these two 
operators will always return sharp values, without changing the state after the first measurement. 

Thus, the operators A, B can be simultaneously measured to have sharp values if and only if 
all eigenstates of A are also eigenstates of B. Two such observables are called compatible, and 
two observables not satisfying this relation are called incompatible. We can make this statement 
more concise. If the eigenstates of A are exactly those of B, then A and B are simultaneously 
diagonalizable, which occurs if and only if [A, B] = 0 (we proved this in the last lecture). This is the 
concise condition for compatibility: two observables are compatible if and only if the corresponding 
operators commute. 

As an example, for the spin-5 system, the operators S$? = $? + ae + S? and S* (or the spin 
along any given axis) are compatible operators, while S* and S* are incompatible. Note that 


[S?, $7] =ihSY £0. (4.9) 


Given a Hilbert space, we can ask what is the maximum number of mutually compatible observ- 
ables we can find. A complete set of commuting observables is a set of observables {A, B,C,...} 
such that all pairwise commutators vanish, 


[A,B] =[A,C)=|(8,0) = =0, (4.10) 


Lecture 4 8.321 Quantum Theory I, Fall 2017 23 


and such that for any a,b,c,..., there is at most one solution to the eigenvalue equation 
Ala) = ala), 
Bla) = dla) , 
Cla) = cla), ay 


4.3. The Generalized Uncertainty Relation 


Consider two observables A and B. If we measure A, we want to discuss the variance of the 
probability distribution of possible outcomes. We do this in the standard way: we define the 


variance of A as 
AA* = ((A— (A))”) 
= (A*) — (A)? (4.12) 
= (4|A*|b) — (vIAly))?. 
If |W) = |a) is an eigenket of A with eigenvalue a, then 
p|A?|) =a?, 
ee ve 


so AA? = 0. In general, however, the variance will not vanish. 
We further define the standard deviation AA = VAA?. The uncertainty relation states that 


AAAB > SIA, BI) | (4.14) 


(Note that this is not the strongest possible form of the uncertainty relation, but it is the most 
commonly seen form.) 


Proof. We will make use of the Cauchy—Schwarz inequality: for all |a),|b) € H, 
|(a|b)|? < (ala) {b)) . (4.15) 


For a general |7), define the operators 


a (4.16) 
6B=B-|B), 
and the states la) = 5AlW) 
a = ? 
4.17 
2) = 6B IY). oe 
Applying the Cauchy—Schwarz inequality, we then have 
|(W|SAdBlb)|? < (b|(5A)? |v) (Y| (6B)? |v) - (4.18) 


Note that 6A and 6B are Hermitian, but there is no guarantee that their product is Hermitian. 
However, we know that we can write 


dSAOB+6B5A 6ASB-—SBOA 


5ASB = ; + ; (4.19) 


Lecture 4 8.321 Quantum Theory I, Fall 2017 24 


The first term on the right-hand side is Hermitian, while the second is anti-Hermitian. We can 
write this more concisely as 


1 1 
5ASB = ={A, B} + 5[A,B]. (4.20) 


Thus, 
(5A5B) = 5 (54, 6B}) s 5(lA, 5B). (4.21) 


The first term on the right-hand side must be real, as the eigenvalues of any Hermitian operator are 
real, and the second term on the right-hand side must be imaginary, because any anti-Hermitian 
operator can be written as 7 times a Hermitian operator. Thus, the squared modulus of this quantity 
will simply be the sum of the squared moduli of each of these terms. We then have 


|(5A5B)|? = 7 (\({54,6B})P + |([54,58))|?) . (4.22) 


This statement leads to the strongest form of the inequality, but also implies the weaker statement 


\(SASB)|2 > Fl (64, 6B). (4.23) 
Now, note that 
(5A, 5B] =[A— (A), B —(B)] = [A,B], (4.24) 
so we have 
\(6A6B) > (A, BP. (4.25) 


Returning to Eq. (4.18), we then see that 


AA? AB? > |(5A5B)|? > =|([A, BI), (4.26) 


ALR 


which completes the proof. 


4.4 Position and Momentum 


We will now move on to study observables with continuous eigenvalues. In order to do so, we 
need to work with infinite-dimensional Hilbert spaces. Consider any Hermitian operator € with a 
continuous spectrum 
géy=é|é), eR. (4.27) 
How should we regard the overlap of two states, (€’|€”)? For a discrete spectrum, we know 
that the overlap of two distinct (normalized) eigenkets of a Hermitian operator are orthonormal. 
We want to appropriately generalize this statement. For a discrete spectrum, the orthonormality 
condition is stated as 
(ala’) = Ogals (4.28) 


In the continuous case, we generalize this to 
(e'|e") = 6(6' — €"), (4.29) 


where the Dirac delta “function” 5(x) is defined to be the object satisfying 


i. an oe) =1 (4.30) 


Lecture 4 8.321 Quantum Theory I, Fall 2017 25 


with 6(2) = 0 for all 2 4 0. This has the property 
/ dz 6) F(x) = FO). (4.31) 
We can think of this as the limit of a sequence of increasingly peaked functions centered at x = 0, 


each with total area 1. 
We will also generalize the complete relation 


1= 7 la)(a oe) 


that is familiar from the discrete spectrum case. For continuous spectra, the equivalent statement 
is 


1= [- dé’ |é")(é'| . (4.33) 


This is called resolution of the identity. For an arbitrary state |W), we then have 
[oe 
jw) = fae’ [e)(ev). (4.34) 
ZOO 
We can express an arbitrary inner product as 


(lwp = fae (wile’y(ety). (4.35) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 22 


Lecture 5 (Sep. 20, 2017) 


5.1 The Position Operator 


In the last class, we talked about operators with a continuous spectrum. A prime example is the 
position operator. Let’s first consider a particle in d = 1. We define x as the position operator, 
with corresponding eigenstates |x’) satisfying 


z|x') = a'|a'). (5.1) 


Here, we are using the Sakurai notation, where operators are symbols with no hats or primes, and 
the eigenstates are named by their eigenvalues. 

We postulate that the |2’) form a complete set of states for the Hilbert space. This means that 
we can expand any state |) as 


jw) = fae! lo’) (a"Wy. (5.2) 


In particular, the identity can be resolved as 


1= i aie: (5.3) 


5.1.1 Measurement of Position 


Imagine an experiment in which we measure the position of a particle, which could come in the 
form of a detector that registers a click when a particle enters the detector. Any physical detector 
will have some finite resolution, below which we cannot resolve the precise location of the particle. 
Assume we have a detector that detects a particle when it is between 2’ — a and a! + 4, 

We will now modify our “collapse” postulate from earlier in the course. After measurement, 
the state of the particle will be such that a second measurement of x will yield a value between 
S and x! + 4. Mathematically, this means that an initial state 


2 
my =f ” dat! |a!) (2! fp) (5.4) 


is sent by the measurement to a state 


gz! — 


1, A 
vrs 


jw) 1) = fae a") (o" ly). (65) 


We assume that the resolution A is small by comparison with the length scale of variation of 
(|). Then, we can approximate this function to be constant over the range [2’ — A. x! + 4], in 
which case we find 


Prob(detection) = |(x"|)|?A. (5.6) 
Taking the resolution to be infinitesimal, A > dz’, this statement becomes 
Prob(detection) = | (x'|q)|? da’. (5.7) 


This is the expected statement: the probability density at a point is the squared modulus of the 
wavefunction at that point. The total probability that the particle exists is 


Prob(particle is somewhere) = 7 da’ |(2'|w)|? = 1, (5.8) 


which is true if (a|q) = 1. The position-space wavefunction is defined as 


(2"b) = (2'). (5.9) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 23 


5.1.2 Hilbert Spaces 


We now make a brief aside about the actual definition of a Hilbert space. We generalize our notion 
of a vector space to the infinite-dimensional case. Consider any sequence of states in the space 
that has the property that for any € > 0, if we look sufficiently far along in the sequence, any two 
states in the sequence after that point will be closer together than € under the metric induced by 
the inner product. The additional property beyond those of a vector space that a Hilbert space 
has, called completeness, is that all such sequences must converge to another state in the space. In 
practice, this means that we can approximate any given state arbitrarily well by a series expansion. 
For more information on Hilbert spaces, see the handwritten notes or the notes from Recitation 1. 


5.1.3 Generalizing to Particles in Dimension d 


So far we have talked about particles moving in only one dimension. We want to generalize this to 
particles moving in d dimensions for arbitrary d. In this case, there are d position operators, 


C= (iy nse (5.10) 


We assert that the {x;} are a set of compatible observables, meaning that we can simultaneously 
measure each of them. This allows us to know the position of a particle along all axes at the same 
time. (It is interesting to consider what happens if we relax this assumption, for example, if we 
have a particle in two dimensions where the x and y operators do not commute. We will discuss 
such a system later in the course.) 
We can then find a complete set of states that are simultaneous eigenstates of each of the 
position operators, which we label as 
z|z") = x" |z"), (5.11) 


and then we proceed as in the d = 1 case. 


5.2 The Momentum Operator 


In the position basis, the momentum operator is given by the familiar expression 


nee 5.12 
BS (5.12) 
On the space of square-integrable functions 7(x), this operator acts as 
dy 
= —-ih—. 5.13 
pb(x) = ih (5.13) 
From this, we see that 
d d 
[ephy = ~in( 2% ~ © (ev)) = ins), (5.14) 
from which we conclude that 
esp] Sah. (5.15) 
Applying the generalized uncertainty principle, 
1 
AAAB > glMIA, B))| : (5.16) 
we find the Heisenberg uncertainty principle, 
h 
AzAp > 5° (5.17) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 24 


5.2.1 Momentum Basis 


The momentum operator is Hermitian, and so we can find a complete set of eigenstates of the 
momentum operator. The momentum eigenstates |p’) satisfy 


pip’) =p |p’), (5.18) 


and they form an orthonormal basis with 
(p"\p’) = 5(p" = p’) ; (5.19) 


The identity can then be resolved as 
1= i dp’ |p')(n'|, (5.20) 


which allows us to decompose an arbitrary state |w) as 


|b) = i dp! |p')(p'|v). (5.21) 


Analogously to the discussion of measurement of position, measurements of momentum will 
have 
Prob(momentum is between p’ and p’ + dp’) a |v’ |w) |? dp’. (5.22) 


We similarly define the momentum-space wavefunction as 


(p' |b) := v(p'). (5.23) 


Although we will rarely use this notation, it is worth noting that Sakurai denotes the momentum- 
space wavefunction as 


(p' |b) := o(p'). (5.24) 


How are |p’) and |2’) related? We need to find (2’|p’). We can determine this by inserting a 
momentum operator: 


— st (a! fp!) = (a!|p|p") = 0! (2!) (5.25) 


The expression on the left-hand side is found by using the expression for the momentum operator 
in the position basis,since (x'|p’) is the position-space wavefunction for the state |p’), while the 
expression on the right-hand side is found by having the momentum operator act on the ket |p’). 
This solution to this differential equation is 


(2'|p') = Ne / (5.26) 


for some normalization factor N. 
To fix the normalization, we use 


(a |a"’) = 6(2'— 2"), (5.27) 


which implies 


/ a ap! (a!|p’)(p'|2"") = 6(a! — 2"). (5.28) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 25 


The left-hand side of this equation is 
/ dp! (2'|p')(p'|2") = we f dp! P(e #")/h Qrh|N|?5(2' — x"), (5.29) 

which must equal the right-hand side, 
Qrh| N|?6(2’ — x") = 6(a'— 2"). (5.30) 


This only fixes the modulus of N, but by convention we choose N to be real and positive, so that 
we have 


1 
N= at (5.31) 
T 
This finally gives us 
1 ak oe 
(2p) = eer" (5.32) 


This allows us to relate the wavefunctions (2’|W) and (p’|q) in the position and momentum bases. 
By inserting the identity, we can write 


! = r/ ND / dp’ ip'x' ! 
(ul|) = fal (elle yiolw) = fe Ml). (5.3) 


Thus, we see that the momentum-space wavefunction is related to the position space wavefunction 
by the Fourier transform. We similarly have 


dz’ 


—ip'a'/hy I 
aah e (x' |) . (5.34) 


(p'|b) = 


5.3. Normalization of Position and Momentum Eigenstates 


What is the wavefunction corresponding to the position eigenstate |x’)? We have 
(x"|a!) = d(x" — a’). (5.35) 


This is not a square-integrable function, and so it is not actually an element of the Hilbert space, 
but we can nonetheless build all square-integrable functions out of the position states. Similarly, 
the momentum eigenstates are not elements of the Hilbert space, but can be used to decompose 
any square-integrable function. 

How can we make this mathematically precise? We do this by changing the problem, and stating 
that our actual problem of interest is well-approximated by the new problem. The states |x’) and 
|p’) are not normalizable, and so we will change the problem to introduce normalizable states. We 
do this by assuming that our particle is moving in some finite-size space, and then take the finite 
size to infinity at the end of the problem. 

For example, we can define momentum states by putting our particle in a large box of size L 
with periodic boundary conditions, 


eiP (a +L)/h = eip'e'/h : (5.36) 


This implies that 
pL=2nn, neZ. (5.37) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 26 


We now have discrete momenta, and a countable basis of momentum eigenstates. For finite L, we 
require that 


L 
[ da’ |(x' |p’)? =1, (5.38) 


which normalizes the momentum eigenstates: 


(x'|p') = ae (5.39) 


5.4 The Uncertainty Principle 


The uncertainty relation tells us that if the position-space wavefunction w(x’) has a shape with 
some width £, then the momentum-space wavefunction ¢(p’) will have a shape with some width of 
order h/€, as seen in Figure 1. 


b(p’) 
v(x’) 


h/t 


Figure 1: Example position-space and momentum-space wavefunctions that are related by the 
Fourier transform. The uncertainty relation tells us that their widths will be roughly reciprocal to 
one another. 


5.4.1 Minimum Uncertainty States 
For what wavefunction is the uncertainty relation AxAp > z saturated? The answer is a simple 


Gaussian, 


a! 


w(a') = AeW 2w . (5.40) 


You will prove this in the homework. 
5.5 Momentum and Translation 
We will now use the momentum operator p to define a unitary operator 
Tia)=e", aeR. (5.41) 
This operator is unitary because p is Hermitian: 


(T(a))t = el?!" = (T(a))*. (5.42) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 27 


The operator exponential used here can be defined using the series expansion, 


T(a) = eP/h — 3 ee : (5.43) 


! 
=u nN: 


Note that this definition can only make sense when acting on a Hilbert space. 
We can see that this operator satisfies the following properties: 


1. T-1(a) = T(-a) 

2. T(a’)T(a") = T(a' +a") 

3. T(a)aT'(a)=2-a 
Let’s prove the final property. 


Proof. Define 
F(a) = T(a)aT~"(a) = cP A/ Pz etPa/? (5.44) 


The trick to the proof will be to compute the derivative of this quantity with respect to a: 


“ _ + (enPA/*(—paetra/® i eW teal aye'0/*) 
= re!" p eyivalh ae 
= nen 
==. 
We can then integrate this equation to find 
F(a) = F(0)-a. (5.46) 


Because F'(0) = x, this completes the proof. 


For this reason, we will call T(a) the translation operator. What happens to a state |W) under 
the action of this operator? We know that 


T(a)xT~'(a)|2") = (4 — a)|2") = (a' —a) |x") : (5.47) 
By applying T~!(a) to each side of this equation, we find 
tT ‘(a)|x') = (a! — a)T~*(a)|2’) ; (5.48) 


from which we conclude that 
T~'(a)|x") x |x’ —a). (5.49) 


This operator does not change the modulus of a state because it is unitary, so we actually have 
T~'(a)|x") = |2' — a). (5.50) 
The effect on the wavefunction is thus 


(a'|T(a)|¥) = (T*(a)|2"), |)) = (a! — al) = o(2! — a). (5.51) 


Lecture 5 8.321 Quantum Theory I, Fall 2017 28 


We see that T'(a)|q) has the shifted wavefunction w(a’ — a) if |=) has the wavefunction ¢(x’). This 
wavefunction is shifted to the right. 

We have now seen that this unitary translation operator, which is a natural operator to define 
acting on the Hilbert space, is determined entirely be the Hermitian operator p. We could have 
turned this story around, by first defining the unitary translation operator and then using it to 
define a Hermitian operator p. In the classical limit, the operator p found in this way has the right 
properties to be called a momentum. Note that any unitary operator U can be written as e’4 for 
some Hermitian operator A. This is the approach we will take when we discuss angular momentum 
later in the course: it will be defined to be the operator that, when exponentiated, rotates the 
quantum mechanical system. 


Lecture 6 8.321 Quantum Theory I, Fall 2017 33 


Lecture 6 (Sep. 25, 2017) 


6.1 Solving Problems in Convenient Bases 


Last time we talked about the momentum and position bases. From a practical point of view, it 
often pays off to not solve a particular problem in the position basis, and instead choose a more 
convenient basis, such as the momentum basis. Suppose we have a Hamiltonian 
i <a? 
A =-—— 7+ a0. 6.1 
2m dx? 6.1) 
We could solve this in the position basis; it is a second-order differential equation, with solutions 
called Airy functions. However, this is easy to solve in the momentum basis. In the momentum 
basis, the Hamiltonian becomes 
2 
P pik 
A = —+iah—, 6.2 
2m dp 82) 
which is only a first-order differential equation. 
For other problems, the most convenient basis may not be the position or the momentum basis, 
but rather some mixed basis. It is worth becoming comfortable with change of basis in order to 
simplify problems. 


6.2. Quantum Dynamics 


Recall the fourth postulate: time evolution is a map 


WO) > |v)) =U DY@), (6.3) 


where U(t’,t) is the time-evolution operator, which is unitary. In the case where t’ and ¢ differ by 
an infinitesimal amount, we can rewrite time evolution as a differential equation, 


vad 
ihe WO) = BOW), (6.4) 


where H(t) is the Hamiltonian, which is a Hermitian operator. 
In general, we can always write 


w(t)) =U) dO) (6.5) 


for some operator U(t’,t). What statements can we make about the operator U(t’,t)? We can say 
the following: 


1. The probability must be conserved. We start with a state for which the total probability of 
all outcomes of a measurement is 1, and so we expect that after evolving in time, we should 
arrive at a state that also has total probability 1. This implies that U must be unitary, which 
we will now prove. 


Proof. Let us expand the state |y(t)) as 


|W(t)) = So en(t)lan) , (6.6) 


Lecture 6 8.321 Quantum Theory I, Fall 2017 34 


with {|a,,)} a basis of eigenstates of any observable A. The probability of a measurement of 
A yielding the result a, at time t is given by 


Prob(A = ay, at time t) := p(an;t) = |en(t)|*, (6.7) 


where we have introduced the shorthand notation p(a,;t) for this probability. The total 
probability must be 1, 


Span; t) =1, (6.8) 
n 
which is simply the statement that the state |q(t)) must be normalized, 


S len (t)|? ee (6.9) 


n 


At a later time t’, we have 
p(anst') = len(t’) |, (6.10) 


and once again, requiring that the total probability be 1 requires that |~(t’)) be normalized, 
Miers. (6.11) 

Thus, we must have 
(Y(E)OE)) = (YOON QU. vO) =1 (6.12) 


for arbitrary normalized |7)(t)). This is satisfied if and only if U'(t’,t)U(t’,t) = 1, ie., if U is 
unitary. 


2. We demand that the time-evolution operator satisfy composition, i.e., 
U(ts, ti) =U (te, (Ut, ti) (6.13) 
for all t; <t <ty. 


3. We require that 
See ale (6.14) 


We will now build up an arbitrary time-evolution operator as a sequence of infinitesimal time- 
evolution operators. Consider a time evolution U(t + dt,t); we can expand this as a power series 
in the infinitesimal dt, yielding 


U(t + dt,t)=1- (HT dt + O((dt)”) (6.15) 
for some operator H (which we do not yet know anything about).The factors of 7 and hf are put in 


for historical reasons, but note that we can always expand U(t + dt, t) in this way. We will neglect 
higher-order terms because dt is infinitesimal. We then have 


Ut (t+ dt,t) =1+ = dt + O((dt)?). (6.16) 


Lecture 6 8.321 Quantum Theory I, Fall 2017 35 


Requiring unitarity, U'U = 1, then gives us 
uu = (1 = part.) (1 + pit att...) =1+ 5 (Ht - 1) dt + O((dt)?) =1. (6.17) 


We conclude that we must have Hi = H, i.e., H must be Hermitian. The statement that U 
is a unitary operator is equivalent to the statement that H is Hermitian. We will call H the 
Hamiltonian. Shortly, we will make the connection between this operator and the Hamiltonian we 
know from classical physics. 

Applying the infinitesimal time-evolution operator to a state, we find 


|(t + dt, t)) = (1 7 Hat + O((at)?))|¥()- (6.18) 


We can write this statement as d 
iho) = Aly(t)), (6.19) 


which we recognize as the Schrodinger equation. 

Now let’s connect this to the discussion of unitary spatial translations. We can think of time 
evolution as translation in time. At the end of the last lecture, we alluded to the fact that an 
infinitesimal spatial translation is generated by the Hermitian momentum operator. This is exactly 
analogous to the statement that infinitesimal time evolutions are generated by the Hamiltonian, 
which is Hermitian. (Note that momentum is the conserved Noether charge associated to spatial 
translations, and energy is the conserved Noether charge associated to time translations.) 


6.2.1 Brief Aside on Classical Mechanics 


In classical mechanics, time evolution is dictated by Newton’s laws. Over the centuries, Newton’s 
laws have been massaged into various forms. Classically, the state of a particle is specified by its 
position q(t) and momentum p(t). Given such a state, how do q(t), p(t) change as a function of 
time? This is described by the equations of motion for the particle, which are known as Hamilton’s 


equations: 
dq OH dp OH 


dt Op’ dt 0q’ 820) 
where H = H(q,p) is the classical Hamiltonian. 
As an example, consider ; 
H(q,p) = 5— +V(q). (6.21) 
Hamilton’s equations then give 
q= “ , p=-VV, (6.22) 


which are simply Newton’s equations of motion. 

The space of (q,p) is known as phase space. The classical time evolution given by Hamilton’s 
equation is such that the area of a region of phase space is preserved by the time evolution (Liou- 
ville’s theorem). Suppose we have a collection of particles, each with different values of g,p. This 
collection fills out a region in phase space. If we evolve each particle in time, the shape of the region 
in phase space will change, but its area will be conserved. 

We can state Liouville’s theorem formally in terms of the Poisson bracket, defined as 


_ Ofdg ogof 


{fg} = Sede (OnOR (6.23) 


Lecture 6 8.321 Quantum Theory I, Fall 2017 36 


Any observable A (that only depends on time through qg and p) evolves in time according to 


dA 
a {A, H}. (6.24) 
In particular, taking A = q yields 
dq OH 
—__. = AH —— A ae 2 
P= @H= 5 (6.25) 
and taking A = p yields 
dp OH 
ees ASS 2 
7 =H} =-F (6.26) 


Classically, the Hamiltonian is (usually) the total energy of the system. In quantum mechanics, 
we refer to H as the Hamiltonian because it is the operator that returns the energy of the system. 
In quantum mechanics, the Hamiltonian (energy) operator and the time operator are more deeply 
related because of the time-energy uncertainty relation. 


6.2.2 The Heisenberg Picture 


Let’s start with the Schrédinger equation 


sad 
ihe WW) = HIv@). (6.27) 


For a closed system, H is independent of time, and we can solve this equation exactly, in some 
formal sense, as 


[ab(t)) =e /P(Ab(0)) . (6.28) 
Thus, the time-evolution operator is given by 
U(t,0) = etHt/h (6.29) 


The physically significant quantities are matrix elements of operators, 


(v'®/Alv@) = (v"(|U*(, 0) AU(¢,0)]¥(0)). (6.30) 


Here, we have assumed that A does not have explicit time dependence. Thus, the time dependence 
of the matrix element comes only from the fact that the states change in time. However, we can 
look at the expression on the right-hand side and define a time-dependent operator 


Ay(t) := U'(t,0)Aq(0)U(t,0), Ay(0) =A, (6.31) 


that acts on states |W) := |w(0)) that do not evolve in time. Viewing time evolution in this 
way is referred to as working in the Heisenberg picture. Up until now, we have been working in 
the Schrédinger picture, in which the time dependence of matrix elements comes from the time 
dependence of states |¢(t)), and operators without explicit time dependence do not evolve in time. 
By design, we have 


(wt |An(#) |v) = (v"(0)|UT(t, 0) AU, 0)|(0)). (6.32) 


We can write an infinitesimal version of Eq. (6.31) as 


a = & (ctl An (Oe) 
= On HAn(0) — Ay(0)H)e~#t/h (6.33) 
1 


= SlAn(t), H]. 


Lecture 6 8.321 Quantum Theory I, Fall 2017 37 


Comparing this with the classical time-evolution expression 


a = {A, H}, (6.34) 


we see that the commutator in quantum mechanics plays a similar role as the Poisson bracket in 
classical mechanics. 
6.3. Energy Eigenstates 


We will now assume that the Hamiltonian H corresponds to the energy of the system. Because H 
is Hermitian, we can diagonalize it with a unitary transformation. Let |7) be its eigenstates, 


Aj) = E59), (6.35) 


which form an orthonormal basis. We call the |j) energy eigenstates, and the E; the corresponding 
energy eigenvalues. An arbitrary state |q) can be decomposed as 


Iv) = do esl) (6.36) 
j 
with 
cj = Gly). (6.37) 


Similarly, an arbitrary operator A can be decomposed as 
A= 0 Ajy 
j,5' 


In the Schrodinger picture, Aj; is time-independent, and the states evolve as 


eal (6.38) 


|wW(0)) + [(t)) = e*/" | W(0)) 
= ey c;|9) 


; (6.39) 
= ej). 
j 


Thus, we can write 


We) = So ei(t)|d) , (6.40) 
j 
where the coefficients c;(t) evolve in time as 
cj(t) = een (6.41) 


The matrix element of A is then 


(OIA) = D7 Oey (t) VIAL’) 
aig! 


(6.42) 
= S5 ck (Oey (Oe BHM A 
Sn 


Lecture 6 8.321 Quantum Theory I, Fall 2017 38 


In the Heisenberg picture, the states are time-independent, so the c; are fixed, while A evolves 


as . ; 
Aw A(t) = eiHt/h 4 e—tHt/h 


= ee i(c iHt/h| 9) (4! je) 


(6.43) 
= “¥(4 rE NGG. 
The matrix element is then 
(PIAQ)b) = So hegre FM AG, (6.44) 
5,5 
which agrees with the result in the Schrédinger picture. 
6.4 Example: Spin Precession In a Magnetic Field 
Consider a spin-5 moment in a magnetic field, with Hamiltonian 
ge 
H=-=—S-B. A 
5m” (6.45) 


Here, e is the electric charge, m is the mass of the particle, and g is the g-factor, which has g + 2 
for the electron. Assuming that B = BZ, we can write the Hamiltonian as 


H= -5_S*B. (6.46) 


With this assumption, the S* eigenstates are also energy eigenstates, 


gehB 


HiSi = St 6.47 
ISz) = FG |S) (6.47) 
with energy eigenvalues 
gehB 
Exy= ; 6.48 
Soe (6.48) 
The energy separation is 
gehB 


hw = |E, —E_| = (6.49) 


2m 


Lecture 7 8.321 Quantum Theory I, Fall 2017 39 


Lecture 7 (Sep. 27, 2017) 


7.1 Spin Precession In a Magnetic Field 


Last time, we began discussing the classic example of precession of a spin-5 particle in a magnetic 
field. The Hamiltonian of this system is 


ge 
H=-=S-B. 1 
aed (7.1) 
With B = B2Z, this becomes 
Ge az 
A=-=S’B. 2 
om ue) 
The energy levels are 
gehB 
Ex,= 
2 (7.3) 
giving a level splitting of 
AB 
ae ea (7.4) 
2m 
We define eB 
s, 0) 
ee (7.5) 


so that the Hamiltonian can be written simply as 


A =wS*. (7.6) 


7.1.1 Schr6dinger Picture 


The time-evolution operator can be expressed in the energy eigenbasis (which coincides with the 
S* eigenbasis) as ; 
U(t, 0) = ett/h = ews *t/h : (7.7) 


Suppose that we have an arbitrary initial state 
|b) = c4|+) + ¢-|-). (7.8) 
Using the time-evolution operator, we find that 
|(t)) = UE, 0)|v) 
= ¢~wS*t/hyy,) (7.9) 


= ees Pe Se onl ; 
We now know the state of the system at all times. For example, if we initially have |W) = |+), then 
Opes (7.10) 


which has 
A 
Prob(S* = #) =1 (7.11) 
for all times. This is why energy eigenstates are often called stationary states. Because time 
evolution is generated by the Hamiltonian, energy eigenstates do not change under time evolution 


(up to an unphysical overall phase). 


Lecture 7 8.321 Quantum Theory I, Fall 2017 40 


Consider instead the case where the initial state is the spin-up eigenstate of $”, i.e., 


> oh 


IW) = ell) +). (7.12) 
Then we have 1 
WO) = Tle") +e 1-)), (7.13) 


Thus, after a time t we have 


Prob(S?= g at time t) = | (SF |(t))|? 


1 
= SIGH + Dee (ats 
gf wt 
= cos ($) : 
Similarly, 
Prob(S” = —% at time t) = sin?( ) . (FAS) 


We can check that this is true, but we know it must be true by conservation of probability. We 


then have : k 
ree) gf wt . of wt a 
(or y= 5 cos ( ) sin ( 7 )| 5 cos(wt) . (7.16) 


We see that this oscillates as a function of time, with angular frequency w, as we expect from our 
classical intuition. What can we learn from this calculation? If we have a general initial state 
represented by ” on the Bloch sphere, then ” will precess around B with angular frequency w. 
You will show this on the homework. 


7.1.2 Heisenberg Picture 


We can carry out this same calculation in the Heisenberg picture. In this picture, the states do 
not change as a function of time, but rather the operators do. The spin operator in the Heisenberg 
picture evolves according to the equation 


S(t) = eth ge WSU (7.17) 


Taking the time derivative of this expression, we find 


dS iw iwS*t/hy oz —iwS*t/h 
ON aan eS : wl 
ai 7 € [S*, Sle (7.18) 
The z-component of this equation of motion is simple, because [S*,S*] = 0, so we have 
dS* 
=0 (7.19) 


dt 


The x-component is found using the fact that [S*,S*] = 7hS¥, which gives 
- = sot ee = —w/5"(t). (7.20) 


Similarly, using [S*, SY] = —ihS*, we find 


= =a" (4), (7.21) 


Lecture 7 8.321 Quantum Theory I, Fall 2017 Al 


We can write these three expression compactly in vector notation as 


dS 


This is the same equation as for classical spins, but the interpretation is entirely different because 
S is now an operator. If we take the expectation value of each side of this expression, we will get 
the same answers that we found in the Schrédinger picture. 


7.2 Particle in a Potential 


We have said, on general grounds, that time evolution is given by a unitary operator. Furthermore, 
we have said that the infinitesimal form of this operator corresponds to a Hermitian operator, which 
we have called the Hamiltonian. However, we must ask: how do we specify a quantum mechanical 
system? There are two main ingredients we need: we have to first specify the Hilbert space that the 
states live in, and then we have to specify the Hamiltonian. Either of these is insufficient without 
the other. 

For a particle in a potential, the Hilbert space is the space of square-integrable functions (modulo 
magnitude and phase). In order to specify the Hamiltonian, we will simply take the classical 
Hamiltonian and replace the variables x and p by the corresponding operators x and p. We must 
be careful, because although the regular variables x and p commute, the corresponding operators 
do not. 

Suppose we have the Hamiltonian 


H= om +V(c). (7.23) 


How do we proceed? In the Schrédinger picture, we first find the energy eigenkets |7) and eigenvalues 
E;, which satisfy 
A\j) = E5|j) . (7.24) 


Once we have found these eigenkets, we can expand an arbitrary state as 
Ib) = > esls)- (7.25) 
J 
Time evolution is carried out as 
WO) =S) Ge i" ;). (7.26) 
J 
In principle, this is a complete procedure to determine the dynamics of any quantum system. 


However, in practice this can often be very difficult. 
In the Heisenberg picture, we simply care about the operators, 


ry(t) = ete HAY: 
pu(t) a ebHtt/RyetHt/h (7.27) 


Using the Heisenberg picture equation of motion, we see that the operator xy evolves according to 


diy 1 1 Pt 1 Pit 
ee et H) = — =] 4 V(x =—I|z2r : 7.28 
; j [cH, H] j cH, es (rH) | ( ) 


Lecture 7 8.321 Quantum Theory I, Fall 2017 42 


From this point forward, we will drop the subscript H on the Heisenberg picture operators. Using 
a commutator identity, we then have 


dx 1 p 
~ = (|x. |p p[s, |) (7.29) 
p 
~ im 


dp 1 
de = eV 
= a(-inE Ve) + Vein) (7.30) 
dv 
da 


Here, we have made clear the action of the derivative operator by using - to denote an arbitrary 
position-space wavefunction on which these operators could act. Once again, this resembles the 
classical expression, but is now an operator equation. 

If we take the expectation values of Eqs. (7.29) and (7.30), we find 


d(x) _ (p) 


dt — m’ 

d(p) | dv 

dt dx /~ 
This result is called Ehrenfest’s theorem. It is important, when using the second equation, to 


remember to differentiate the potential first before taking the expectation value, because reversing 
the order of these operators will often change the result. 


(7.31) 


7.2.1 Example: Charged Particle in a Uniform Electric Field 
Consider a charged particle in a uniform electric field, which has the Hamiltonian 


pe 
H = —-—qKE(t)z. (7.32) 
2m 


In the Schroédinger picture, this is a messy problem to solve. In the Heisenberg picture, however, 
the problem is not difficult at all. Using Eq. (7.30), we have 


dp 

—=qE(t). : 

Gp GEO) (7.33) 
This has solution : 

p(t) =p(0)+ | dt’ gE(t’). (7.34) 
0 

Similarly, using Eq. (7.29), we have 

d t 

alae AU (7.35) 


Lecture 7 8.321 Quantum Theory I, Fall 2017 43 


which yields 
t v 
x(t) — 2(0) = OW) + al dt! i dt” E(¢”). (7.36) 


m 


These are the same results we find classically. If E(t) = E is independent of time, then we have 


pO)t | @ pr» 
t= Et t) = «(0) + —~ + —Et’. 7.37 
p(t) =p(0) tat, a(t) =2(0) +7 4 (7.37) 
7.2.2 Example: Simple Harmonic Oscillator 
Recall the simple harmonic oscillator Hamiltonian, 
2 
1 
H=? 4 mu2?. (7.38) 
2m 2 


We are likely all familiar with the approach in the Schrédinger picture. In the Heisenberg picture, 
we have 


Solving these equations gives 


7 p(0) .. 
x(t) = 2(0) cos(wt) + ~~ sin(wt), (7.40) 


p(t) = —mwa(0) sin(wt) + p(0) cos(wt) . 


Finding these equations in the Schrédinger picture is messy, though it can be done; in the Heisenberg 
picture, the result was immediate. 

Keep in mind that these are operator equations. If, for example, we square the operators x(t) 
or p(t), we must be careful with the order of x(0) and p(0) in the cross terms, as these operators 
do not commute. 


Lecture 8 8.321 Quantum Theory I, Fall 2017 40 


Lecture 8 (Oct. 2, 2017) 


8.1 General Time Dependent Hamiltonians 


The Schrédinger equation dictates that quantum states evolve in time according to 


ih |(t)) = Hv) - (8.1) 


In the last class, we saw that if the Hamiltonian is independent of time, H(t) = H, then we can 
solve this differential equation as 


l(t) = eI h(0)). (8.2) 


We are now interested in what happens if H has explicit time dependence. 
We can always expand an infinitesimal time-evolution operator as 


U(t+dt,t)=1- + H(t) dt + ((dt)?). (8.3) 


From the composition rule for time evolution, we then have 


U(t + dt, to) = U(t + dt, t)U(t, to) 


i (8.4) 
=(1- Zt) de Gig) 
We can rearrange this to give 
U(t + dt, to) — U(t, to) = H(t) dtU(t, to). (8.5) 
Dividing both sides by dt/ih and taking the limit dt — 0, this becomes 
t,t 
net) = H(t)U(t, to). (8.6) 
We now discuss solutions to Eq. (8.6) in several cases: 
1. If H(t) = H, this result reduces to our previous result, 
U(t, tp) = ett) /2 (8.7) 


2. If [H(t), H(t’)] = 0 for all t,t’, then we can simultaneously diagonalize the Hamiltonian at all 
times, meaning we can choose a basis of states that are eigenstates of H(t) for all time (the 
associated eigenvalues may change as a function of time). We then have 


- nt 
U(t, to) = exp (-; dt! H(t) ) (8.8) 
h Sto 
We can check that this is the correct expression by going to the diagonal basis of H(t) and 
considering the action of U(t,to) in this basis, or by differentiating the right-hand side to see 
that it satisfies Eq. (8.6). 


Lecture 8 8.321 Quantum Theory I, Fall 2017 Al 


3. In the most general case, [H(t), H(t’)] # 0. An example is a spin-4 particle in a magnetic 
field that changes orientation as a function of time. What can we say in this case? 


We know that the Hamiltonian completely defines the time evolution for an infinitesimal time 


step, and we know that we can build up finite time evolution from infinitesimal time evolution 
using the composition law. We discretize time into N steps, 


to < ty <to<-++<ty_1 <t, (8.9) 
with 
t — to 
y 


We could choose each time interval to have a different length, but the ultimate result will not 
be affected by our choice. 


We take N large, so that At is small. Then, 


At=tyi-t= 


(8.10) 


Oise ij T= _ H(t) At + O((At)?). (8.11) 
If we are ignoring terms of order (At)?, then we can write 


UG eens t;) ~ e tH )At/h 7 (8.12) 


We then build up the finite time-evolution operator as 


N-1 


U(t,to) = U(t,tw—1)U (tn-1, tn—-2) +++ U (tiga, ti) ++ U (ti, to) = II U(tizi,ti). (8.18) 
i=0 


Note that the ordering of these operators is crucial, because these operators do not commute. 
We must ensure that the operators are ordered so that later times are to the left. This 
prescription for ordering of the operators is called time ordering, and the right-hand side of 
Eq. (8.13) is called a time-ordered product. 


We want to take the limit N -> oo, At > 0 with NAt held fixed. Note that every term in 
the product Eq. (8.13) is an exponential, and so we may be tempted to write the product 
as the exponential of a sum of operators. We cannot do this, strictly speaking, because the 
operators do not commute. However, we can invent notation, and formally write 


- nt 
Vii teg) = lex» (-; dt! H(t) , (8.14) 
to 

This is called a time-ordered exponential, and is defined to be the product in Eq. (8.13) in the 
limit N — oo, At > 0 with NAt held constant. The operator T is called the time-ordering 
operator; it reorders the operators in its argument so that they are time-ordered (disregarding 
commutation rules when moving the operators around). What guarantees that this limit is 
well-defined and exists? This is guaranteed because the operator U(t, to) is well-defined, and 
can be written as a composition of infinitesimal time evolution operators for any partition of 
the time interval [to, ¢]. 


An alternate approach is to solve Eq. (8.6) as a formal power series. We can integrate Eq. (8.6) 


to reach i » 7 
at’ “(U(t,to)) =-% fat! H(L)U (10). (8.15) 
tp dt Ah 


Lecture 8 8.321 Quantum Theory I, Fall 2017 42 


If we carry out the integration on the left-hand side, we find 
, d 
i dt! qe U (# to) = U(t, tp) — U(to, to) = U(t, to) — 1. (8.16) 
to 
Subtracting the 1 to the right-hand side of Eq. (8.15), we have 


. t 
Uli) Si< 7 dt! H(t')U(t’,to) - (8.17) 
to 


This is an expression in terms of the Hamiltonian and time-evolution operator at times ft’ 
with ¢’ < t. We can similarly write 


a t! 
U(t',to.) =1-5 | dt” H(e")U(e" to), (8.18) 
to 


where the integrand is evaluated at values t” < t’. We can carry this process out an infinite 
number of times and compose the results to give 


U(ho) =1-5 | “at! H(t’) #3) [aw [ a) Je 


> n t n—-1 
+ (-*) / dt, * dtp f dtn H(t,)-- - A(tn) pets 
h to to to 


Note that the operators in each integrand are time-ordered. This observation allows us to 
write Eq. (8.19) in such a way that the limits of integration are not so complicated, using 


t t! 
[ew dt" H(t’) H(t") = if fr T/A (t')A(t")]. (8.20) 
to to to /to 


The factor of 5 here deals with Ovenrounviny: If we rewrite each term in Eq. (8.19) in a similar 
way, there will be a factor of + =| on the nth term to deal with overcounting. We then have 


(8.19) 


A t 
A Sito 


1 i n t t t 
+3(-3) / at, | dtgs-+ f dt, T[H(t1)-+-H(t,)J+---. 
nN. h to to to 


This series is known as the Dyson series. This looks like the series expansion of an exponential, 
with each term time-ordered. Thus, we have reproduced Eq. (8.14) with a different approach, 


(8.21) 


U(t, to) = tlex(-j ; f “at! H(t yh (8.22) 


8.2 Interaction Picture 


We have seen the Schrédinger picture and the Heisenberg picture; now we will discuss a third 
picture, due to Dirac, called the interaction picture. This is a mixed picture that is useful when we 
can write the Hamiltonian in the form 


H(t) = Ho(t) + V(t), (8.23) 


Lecture 8 8.321 Quantum Theory I, Fall 2017 43 


where we have a very good understanding of the dynamics under the Hamiltonian Ho(t), and we 
think of V(t) as a small perturbation to the system described by Ho(t). 
In the interaction picture, we remove the evolution due to Hop from the state by writing 


Iyx(t)) = Ug ()|vs(t)) , (8.24) 


with Uo(t) the time-evolution operator generated by Hp. Contrast this with the expression in the 
Heisenberg picture, where the states were defined as 


vat) = U-*()|ds(t)) , (8.25) 
The state |y1(t)) evolves in time according to 
im“ a(t)) = Vildfvr), (8.26) 
where 
Vi(t) = Up *(t)V (tol t) . (8.27) 
Operators in the interaction picture evolve according to 
Aj(t) = Up ‘(t)AUo(t). (8.28) 


The interaction picture is useful when we fully understand the dynamics of Hp, and V is a weak 
probe. We can then use this approach to determine how quickly our system absorbs energy from 
the weak probe. In the interaction picture, we are hiding the time evolution due to the system we 
understand, so that we only see the time evolution coming from the unknown part of the system. 

We can now derive the equation of motion for the time-evolution operator in the interaction 
picture. We define 

Ue) = Up) Ur) 5 (8.29) 
where U;(t) captures the time-evolution due to the perturbation. We know that the time-evolution 
operator must satisfy 

int) = (Hp + V)U(t). (8.30) 
By definition, Uo(t) is the time-evolution operator in the system with Hamiltonian Ho(t), so it 
satisfies 


d 
ihe, Uolt) = HoUo(t). (8.31) 
We compute 
aud ee {Wo dU; 
A—U(t) = ith—(UpU) = th Uj+U, 8.32 
De A) n(n 0) oe) 
Using Eq. (8.31) on the first term of the right-hand side yields 
ad . dU, . dU 
—U(t) = Hi hUp — = Hi hUp— . : 
the UC ) gUoU + ihUG di 9U + thUo di (8 BED) 
On the other hand, from Eq. (8.30), we have 
incu) <a va. (8.34) 
Thus, 
dU; 
ino =VU =VUoli, (8.35) 
which gives us 
dU, 
ih = Up Voth = Va(t)Ui(t). (8.36) 


Lecture 9 8.321 Quantum Theory I, Fall 2017 48 


Lecture 9 (Oct. 4, 2017) 


9.1 Spin-4 in an AC Field 


Consider a spin-5 system in a time-dependent magnetic field. The Hamiltonian is 
H=—*s. Bi). (9.1) 
2m 


We will consider a particular class of time-dependent magnetic fields, which we can write in the 
form 
B(t) = Voz + Bi (cos(wt)# + sin(wt)y) . (9.2) 


We have previously studied the case where B, = 0. 
In the case B, = 0, there are two energy eigenstates — the spin-up and spin-down states along 
the z-axis — that have an energy splitting of 


geBoh 
2n |- 


AE= 


(9.3) 


Now that we are considering a field with nonzero B,, we expect the system to be able to absorb 
energy from the oscillating component of the field to transition between the two energy eigenstates 
of the static field. 

We will analyze this problem in the interaction picture. A seemingly sensible choice is to regard 
the Bo term of the Hamiltonian to be Ho, the part of the Hamiltonian for which the resulting time 
evolution is already understood. We write 


H= Ho + V(t) ; (9.4) 
where ‘5 
Ho = —5 SB, 
m 
ge (9.5) 
viy=-H 5, Bilt 
m 


Here, we have defined 
SS, := S24 Sy, 


9.6 
B, := B,(cos(wt)# + sin(wt)y) . Oe) 
We define 3 
2m 
so that we have 
Ao = woSs : (9.8) 


(Note that we are taking e < 0 so that the signs work out here.) The time-evolution operator due 
to this part of the Hamiltonian is 
Up(t) = e*Hot/? (9.9) 


The equation of motion in the interaction picture is 


im“ Ia(t)) = Vi(dvr), (9.10) 


Lecture 9 8.321 Quantum Theory I, Fall 2017 49 


where 
Vi(t) = Up (EV (t)Uo(#) 

= greeny ie arty (9.11) 
B , z > Zz 

= a t/h(S® cos(wt) + SY sin(wt))e oF” 
m 

Before evaluating this directly, it is convenient to define 
SS" io". (9.12) 


These are called the ladder operators. Note that they are Hermitian conjugates of one another. We 
can now compute 


ela" /2 Gt e—io® /2 _ (cos é + io* sin 5) st (cos — io* sin 5) 


2 
2 
= 8" (cos é — io* sin S) (9.13) 
= S" (cos ¢ — io” sin ¢) 
=e". 


Here, in the first line we have expanded the exponentials, using the fact that the Pauli matrices 
square to the identity. In the second line, we have used the fact that o* anticommutes with both 


o* and oY. Note that 
0 1 0 —-i 0 2 
ac } y —- } —= 
a” +i0 6 i) +i(t € 5) , (9.14) 


This is why these operators are called ladder operators (S* specifically is the raising operator). 
Taking the Hermitian conjugate of Eq. (9.13), we have 
gee Bee a Se (9.15) 
Now we can return to the calculation of Vj(t). We have 


Vi(t) = _ 9€BY iusyS?t/h (== ME Se eens 


enn (9.16) 
= _ geBy (stent + Soe) 
4m 
9.1.1 Resonant Drive 
First, let’s consider the simple case where the external frequency is w = —wo. This case is called 
“resonant drive.” In this case, the potential in Eq. (9.16) simplifies to 
geBy yas _ geBy on 
Y= -——(S s-)\S— S 9.17 
: 4m ( 7 ) 4m , ( ) 
which is time-independent. The interaction picture time-evolution operator is then 
-geB 0 
U;(t) = ef smn S°t | (9.18) 


This operator rotates states about the z-axis of spin space by an angle geBi The frequency of 


rotation, 


geBy 


WR = ‘ (9.19) 


2m 


Lecture 9 8.321 Quantum Theory I, Fall 2017 50 


is called the Rabi frequency. Note that the time-evolution operator, 
cate _; (Rt) 50 
Uy(t) = eR S?t/h — © i( 2 ) : (9.20) 


does not oscillate with a frequency of aa but rather oa 


U; (: 2 =) = Ui(t). (9.21) 


This operator has periodicity twice that of any observables in the system. What happens after a 
time ae At this time, we have 


2 a 
ui(=) BT eS (9.22) 
WR 
Thus, any observables will oscillate at the Rabi frequency, but if we were able to measure the phase 


of a state, we would find that it oscillates at half the Rabi frequency. 
Imagine that we start in a spin-up state, i.e., 


lW1(0)) > 6 (9.23) 


where the arrow indicates that we are representing the state as a vector. At time t, we then have 


ln(t)) + erontor/2( 


ae isin(“B*)\ (1 (9.24) 
= mc) ) 


Thus, after a time t = ae we have 


jexn/on)) + (2). (0.25) 


and after a time t = we have 


je2x/on)) -» (i). (0.26) 


where we see that the state has picked up a phase of 7. We find that the state oscillates; this 
behavior is referred to as Rabi oscillation. 


9.1.2 Off-Resonant Drive 


Up until now, we have been working in the case of resonant drive. What happens if we’re off 
resonance, i.e., w #4 —Wwo? In this case, even though we went to the interaction picture, we still 
have a time-dependent Hamiltonian that we don’t know how to solve, because V;(t) is still time- 
dependent. It seems like we have gained nothing. 


Lecture 9 8.321 Quantum Theory I, Fall 2017 51 


The issue is that we made a particular choice for which part of the Hamiltonian H would be 
Ao, but this was not actually the most convenient choice. Let’s instead choose 


H=Hy)+V(t), (9.27) 


where z 
Ho = wS* 5 
- ; (9.28) 
V(t) = -7(BeS* + Bi(t)- $1). 
Here, we have defined 


(9.29) 


Now, let’s work in the interaction picture with these choices. Going to the interaction picture is 
going to a rotating frame, that rotates with Hp. Previously, we went to a frame that rotated about 
the z-axis at frequency wo; now we are going to a frame that rotates about the z-axis at the external 
frequency w. 

We compute 


Vi(t) = Of (t)V )0o(t) 


= eivS*t/h —7BogS” _ (Gr ou Se) ew S*t/h (9.30) 


The first term in brackets is unaffected by the time evolution, because S* commutes with the 
exponentials outside the brackets. The second term in brackets, along with the time-evolution 
exponentials, looks exactly like the calculation we did in Eq. (9.16), but with the frequencies in all 
of the exponentials exactly matched. Thus, we find a 


Vi(t) = —7BerS* — YB. S*. (9.31) 


This is time-independent, and so we now declare the problem solved. The spin precesses about the 
direction of 


B= (Bis 0, Best) 7 (9.32) 


These are modified Rabi oscillations, with a frequency of 


wR = 71/B2.+ B?. (9.33) 


9.2. Path Integral Formulation of Quantum Mechanics 


Time evolution in the Schrédinger picture is given by 


WW) = So ear(t) la’), (9.34) 


where the |a’) form an energy eigenbasis, and 
Cq' (t) = eta (t-to)/Me_ (ty) . (9.35) 
We can consider these particles in position space, and define 


high z= ala ) (9.36) 


Lecture 9 8.321 Quantum Theory I, Fall 2017 52 
to be the energy eigenfunctions. Then the position-space wavefunction is given by 
(a, t) = S° eal t0)/Fe a, (tga (@) . (9.37) 
a’ 
We can rewrite this as 2 
w(x, t) = i da! K (a, t;2',to)¢(2’, to) , (9.38) 
—0o 
with 
(9.39) 


Kita) = Si (ala ye Ae O-o)/e alla’ = (a«|U(t, to) |’). 


The object K(a,t; x’, to) is called the propagator. 


Lecture 10 8.321 Quantum Theory I, Fall 2017 49 


Lecture 10 (Oct. 11, 2017) 


10.1 Path Integral Formulation of Quantum Mechanics 
10.1.1 The Propagator 


In the last lecture, we introduced the propagator, 


Keo a io)l= > (ala’)e Ber Ot0)/F (a! |") = (x|U(t, to)|a’) . (10.1) 
a’ 
In terms of the propagator, we can write the wavefunction in the form 
[o-e) 
w(x, t) = / dtz! K (a, t; x’, to) (a, to) , (10.2) 
—CO 
From the Schrodinger equation, assuming a Hamiltonian of the form 
p2 
H=——4V 10.3 
P+ Vv(a), (10.3) 


the propagator must satisfy the differential equation 


h2 
(ina, + sa - va) K (at) 2%) = 0. (10.4) 
m 
Furthermore, it must satisfy 
: . ml — s(d) ee) 
a K (a, t; 2’, t9) = 6 (a — 2’). (10.5) 


A convenient related quantity is the retarded propagator 
Kyet (a, t; x’, to) = 0(t — to) K (a, t; x’, to) . (10.6) 


The retarded propagator satisfies 
h2 
(inay af a = v(@)) Kret (2, t;2',t') = hd (w — 2')d(t—t/), (10.7) 
m 


i.e., it is a Green’s function for the Schrédinger equation (a solution of the Schrédinger equation 
with a delta function source or forcing term). 
Alternatively, we can think of the propagator as 


K (ee a’, to) = (x, t|a’, to) ’ (10.8) 
which follows from its definition. 


As an example, consider the free particle in one dimension, with Hamiltonian 


2 
P 

on 10.9 
San (10.9) 


The propagator in this case is given by 


Orth e! (10.10) 


eae, eee 
2nih(t — t’) 


Lecture 10 8.321 Quantum Theory I, Fall 2017 50 


The propagator has a number of properties that make it useful to calculate. The first is related 
to quantum statistical mechanics. We define the partition function as 


Git) = [ete K(w,t;2,0) = >) er etl, (10.11) 


If we analytically continue to imaginary time, t = —ih6, then this yields the familiar partition 
function of statistical mechanics: 


G(-thB) = 57 eP Pa’ = Z, (10.12) 


Another useful quantity is the Fourier transform of the partition function, 
(oe) 
G(E) = -i / dt Gree(t)e'™*" = -i 5° | dpe ea. (10.13) 
a’ 79 


where Gret(t) = 0(t)G(t). In order to make this integral converge, we take E > E + ie, which then 
yields 


~ h 


The poles of G in the limit « + 0 describe the energy spectrum of the system. 
We can also use the Fourier transform of the partition function to find the density of states. 
The density of states is defined as 


p(E) = )_ 6(B - Ey). (10.15) 
aq’ 
Using the property 
1 —€ 
lim Im ———_—— = lim ——__.1{~ 
0 B-E+ie c0(E-E +e (10.16) 
= -—nd(E- E’), 
we see that we can write the density of states as 
1 ~ 
p(E) = —-—ImG(E). (10.17) 
Th 


10.1.2 Path Integrals 
From the composition property of time evolution, 
UG ty) ]UEC)U 40) 5. ESE S405 (10.18) 


we see that the propagator satisfies a similar composition property, 
K(x, t;2',to) = paz K (x, t;, i) K (&, §;2',to) . (10.19) 


We can break the interval [t,t] into N equal time intervals of width 


pats 
At= 10.20 
N ? ( ) 


Lecture 10 8.321 Quantum Theory I, Fall 2017 51 


with endpoints 
0 bee ES he (10.21) 


Iterating the composition rule then gives us 


N-1 
K(ay,twizo.to) = fT] K (xn, tw; 2Nn-1,tnw-1) +++ K (22, to; 1,01) K(21,t1;20,to). (10.22) 
k=l 


Feynman proposed that 
KG ety)= [ioajeiseour, (10.23) 


where the right-hand side is a sum over all possible paths from (z’,to) to (a,t) (including those 
that do not satisfy the equations of motion), and S is the classical action of the trajectory. Recall 
that in classical mechanics, the Lagrangian is given by 


DL=-=mz* —V(z), (10.24) 


and the action is given by S = [dt L. (Here, we have assumed that the kinetic energy is of the 
form ma?/2.) The classical trajectory between two spacetime points is obtained by extremizing 
this action, as = 0. This is known as the principle of least action. This requirement leads to the 
Euler-Lagrange equations 


d /OL OL 
—|-—~-]-—=0. 10.25 
dt ( Ox ) Ox ( ) 

For the Lagrangian given above, this gives Newton’s force law, 

d?x OV 
— Se 10.26 
dt? Ox ( ) 

Note that Feynman’s proposed expression for the propagator, 

K(a,% Tit) = [iozjeiseour, (10.27) 


clearly satisfies the composition property. It also gives a very simple connection to classical mechan- 
ics: in the h — 0 limit, we expect the sum over paths is dominated by the path for which the phase 
is stationary (this is the stationary phase approximation, which we will discuss more thoroughly 
later), i.e., the path for which S is extremized. Thus, Feynman’s path integral expression yields 
the principle of least action in the classical limit. 


Lecture 11 8.321 Quantum Theory I, Fall 2017 52 


Lecture 11 (Oct. 16, 2017) 


11.1 Path Integrals 


As we have seen, the amplitude for a state to “propagate” from coordinates (x,t) to (2, t’) is given 
by the propagator, 


Klay et) = [i= er eOlm. (11.1) 


where S[x(t)] is the classical action of the path z(t), 
S[x(t)] = fu Leet) 5 (11.2) 


for L the classical Lagrangian. The integration in the path integral is over all possible paths x(t). 

We need to define this precisely. In particular, we need to write a measure on the space of paths 
in order to know how to properly integrate over all paths. The strategy will be to start with the 
definition of K, and then derive the path integral and the appropriate definition of the measure. 
Then we will go back and recompute K for a free particle. 

The starting point of deriving the path integral is to use the composition law for K, which 
comes from the composition law for the time-evolution operator. Recall that the propagator K is 
the matrix element 


Kee e) = ale ea) |e) (11.3) 


From the composition law, we have 
N-1 
K(xy,tn; Xo, to) =i [| ve K (en, tw; en-1, tw-1) +++ K (21, t1; 20, to) 
k=1 


N-1 
= f TD ate (en itty.twadlena)-+-(ei]U (ta sto)leo) (AL) 
k=1 


N-1 
= / II dap (anle—*#/" lr y_1) soe (xy \e7**4/"| 29) 
k=1 


where ¢€ = t/N. This expression holds in general for time-dependent Hamiltonians, if we understand 
each occurrence of H in the expression above to be a function of the time. 
We now consider a single matrix element in this expression, 


(xj4ile Flas) = (ayy ile OM)" 2,) , (11.5) 
where 
p 
ie = . 11. 
aes V=V(a) (11.6) 
Note that 
: i(T+V) A(T+V)? ee 
eri +V)/b _ y _ tl + yi, tek ze ) fee tT he iV/NY O(2), (11.7) 


Thus, if we take « > 0, N > co with Ne fixed, then we can use 


eT ie(L+V)/h 670, tek /h iV /h (11.8) 


Lecture 11 8.321 Quantum Theory I, Fall 2017 53 


We can only make this replacement legally in this limit, as these two expressions differ at higher 
orders. 
We now choose € small so that we can safely omit the O(e?) terms above. We then have 


(aj4ale*/Mac5) 9. (aj 41 le me #V/F a5) 


11.9 
AGED Vee teVelea) iB : ( ) 


= (xj+ile 


We see then that we only have to evaluate the matrix element for a free particle. We can evaluate 
this by inserting the resolution of the identity in the momentum basis, which results in taking a 
Fourier transform: 


ae, a8 
(xj41)e 0? 22) = fav; (2341|p;)(pjle°? "| 2,) 


dp tp;(x x —ieps /2mh 
= | Phe ip; (@j41— p/he P;/ (11.10) 


m im(aj41-25)7 
= ——  € 2he . 
2Tihe 


This is the same result we found in the last lecture. 
Thus, we have 
: maj 41-05)" ee] 
4 7 +0 m i| € 
(xj+1le tH Pgs —— \l Onihe © a fe ‘i (11.11) 


eV(a;5) 


NE Sigh | agen ve 
) [Thane < (149) 


We then have 


K(ay,tn;%o,to) = lim (= 


In the limit « > 0, we have 


dx 
jp1 — Bj MET, (11.13) 
which gives us 
(ej41 = @,)" of de\* 
~) : 11.14 
€ “\ at ( ) 
Thus, 
N-1 
pig te eV =] 0, af bm (e a _v(a) 
2h 
j=0 : (11.15) 
1 
= = ee one aar 
h Sito 


where L is the classical Lagrangian. This is the desired result. We now define the path integral 
formally through the limiting process 


m(ax zy 2 eV(az;) 
[Dz] ela) /h — ee ye I dit = ime a Ca a (11.16) 
«0 ican) 
Noo 


Ne=tyn —to 


The factor in front of the product of integrals is required to make sure that the limit exists. 


Lecture 11 8.321 Quantum Theory I, Fall 2017 54 


Now we will recalculate the free particle propagator. Let us choose the number of steps to be 
a power of two, N = 2”, and let us define 


N/2 
= t Dj (@j41- aj)? 11.1 
Ky = (=) fl I dxz, € tht ( . 7) 


Again, we have « = t/N. Expanding all of the squares, we see that 


Bi41 —@;)2 = oe + Qat + Qar4--- 4207 ae — 249%, — 24129 —---—24n_1H%yN. (11.18 
j j 0 1 2 N-1 N 
j 


We now carry out all of the odd integrals first, which will cause the problem to simplify. We find 


Ky= | T] ae; f Tae 


(= oor [2azt+az_ pth — 20k (eR 1+2%41)| 
Qriht 


j even k odd 
-/ Ul at / Ul - (2 ~e imam [2(e Pena tet 5 (a2, +22_,)—2e410p-1 
j Lk 
te hodd ae (11.19) 
n—-2 m1 
mrt ) : [ I imor-l gn-1_y : 
= dao, @ 2h e=0 © (2042-24) 
( 2Qriht ra 
— Kyj2 « 
We see then that 
Kon = Kona = Boast So Sy e mnt (tN 0)” (11.20) 


Qriht 


Thus, we have recovered the expected result. 

We can explicitly evaluate the path integral for any system in which the potential is at most 
quadratic in x,z. We will see several examples on the homework. 

The path integral formulation seems unwieldy. Why does anyone bother with this? This is a 
completely different way of formulating quantum mechanics than we have seen previously, but it 
is completely equivalent to our other formulations, so what is the use? Feynman emphasized that 
different formulations, even though they are mathematically equivalent, may each be useful for 
describing different systems. Furthermore, we can use a given formulation to generalize quantum 
mechanics, and the different formalisms may not be equivalent in these generalizations. Finally, 
as Feynman says, it feels psychologically different to be thinking about the same physics from 
a completely different perspective. In practice, path integrals are not usually a practical way of 
solving problems. However, the path integral gives us quantum intuition, which allows us to say 
many things about how systems will behave, even if we cannot solve them exactly. 


11.1.1 Technical Details 


For each time step, we wrote 


tiem ( ®j+1—*% ie 4 
(aj41le* H/h\y Sie oh ( 2 ) nC) (11.21) 
It seems arbitrary that we wrote 


eTic(T+V)/hi wy p-ieT/Re—ieV/h (11.22) 


Lecture 11 8.321 Quantum Theory I, Fall 2017 55 


instead of 
eie(THV)/R nw eth teT/h (11,23) 


in which case we would have had V(#;+1) in the exponent of Eq. (11.21) instead of V(a;). Either 
is a perfecetly good choice to the order at which we are working. A more symmetric choice is to 


BiTBi+1 
use v( 5 : 


This choice is very important for the particle in a magnetic field described by a vector potential 
A (through B = V x A). In this system, the Lagrangian is 


= 1 -2 dx 
L= gine +eA- ra Vie): (11.24) 


Here, in the quantum mechanical path integral we must use 


A> A(% Haut) (11.25) 


See the homework for more on this. 


Lecture 12 8.321 Quantum Theory I, Fall 2017 59 


Lecture 12 (Oct. 23, 2017) 


12.1 Stationary Phase Approximation 


Consider an integral of the form 
fox ee). (12.1) 


with large and f(x) real. Can we use the largeness of \ in order to help evaluate or approximate 
the integral? Let f(x) have an extremum at 7 = xg. The claim is that the integral will be dominated 
by the region around xg. In most regions of integration, the complex exponential oscillates rapidly 
as a function of x, and so the integral over any range will sum many phases that cancel out. The 
region around 29, however, varies slowly, because f(a) achieves an extremum at 29. We thus expect 
that the integral is dominated by x near zo (or, in general, near all xo for which f(a) achieves an 
extremum). 
More precisely, near x79, we can write 


1 
f(a) © f(w0) + sf" (a0) (a — 20)" +++, (12.2) 
where the first-order term vanishes because f(x) has an extremum at xo. We can then write 


, da ePF) we eF(#0) / (dar) ef" (#0) (5x)? /24-— (12.3) 


If we assume that 6x is small, we can drop the higher order terms in the exponential. However, we 
can then extend the limits of integration over 6x to too, because the largeness of A leads to rapid 
oscillation for large 6x, giving cancellation of the exponentials. This yields 


[uw eAF(2) pw eiAf (0) is d(d2) eirAf" (wo) (Sx)? /2 


(12.4) 


— pidf(z) 2m 2 
oO) | (14+ 0(0%)). 


We can generalize this to multidimensional integrals, and even path integrals. We thus similarly 
expect that 


/ [Da] eSkeO/n (12.5) 
is dominated by x(t) = q(t), the classical path, where 
Ye) 
— =0 12.6 
eee 2 (12.6) 
when fi > 0. Then we get 
K (ae tei) © eSlealt)]/h , (12.7) 


Note that this not only gives us the classical limit, but also gives us interference effects in the 
near-classical limit: if there are multiple possible classical paths, then in the near-classical limit, 
these paths can exhibit interference with one another. 


Lecture 12 8.321 Quantum Theory I, Fall 2017 60 


12.2 Quantum Particles in Electromagnetic fields 


We now discuss the motion of quantum particles in electromagnetic fields. In general, we should 
assume that every system is quantum mechanical. However, here we will be using a semiclassical 
approach, in which we will consider a classical background electromagnetic field with quantum 
particles moving in it. There are many phenomena which require the quantization of the electric 
field as well as the particles moving in it, but we will not discuss such phenomena here. 


12.2.1 Constant Potentials 


We start with a potential V. If we change the potential by V ~ V + Vo, this is a shift in the overall 
energy, and should have no effect on the particle motion. Classically, this is true because particles 
move due to forces, not potentials. The force is computed by taking the gradient of the potential, 
which will not be effected by the shift. In quantum mechanics, the shift in the potential results in 
an extra phase under time evolution, 


[(t)) > e VO (t)) . (12.8) 
Thus, the shift serves to change the phase of the wavefunction at each time step. We know that 
such an overall phase shift is not observable. 

However, if we change the potential by a constant shift in one region of space but not in another, 
then we do expect to see an observable effect. Quantum mechanically, we can interfere particles 
that have traveled through this region with those that have not. 

Imagine that we have a beam of particles being emitted from a source A, and two metallic cages 
that the beam could pass through, before passing to a detector at B. Assume that the potential is 
VY, in the bottom cage and V2 in the top cage, both constant. The potential is constant within each 
cage, and so the particles passing through either cage experience no forces. Quantum mechanically, 
there is nevertheless a difference in the phase factor accumulated while passing through the two 
different cages: 


b-o=; fave). (12.9) 


Let us assume that the two metallic cages are completely identical other than the described potential 
difference, and that A and B are arranged symmetrically around the two cages. We thus expect, 
from symmetry, that the amplitude for a particle to travel from A to B would be the same through 
either cage in absence of the potential difference. If we let a be the amplitude to go from A to B 
through the bottom cage in time tr — t;, then the amplitude to go through the top cage must be 


ae~e2—o1) | (12.10) 
The full amplitude to travel from A to B is then 
a(1 os eee) (12.11) 
giving a probability of detection of 
Prob(A + B) = 2|a|?(1 + cos(¢z — ¢1)). (12.12) 


Thus, the constant potential difference between the two cages changes physical observables; this is 
a purely quantum mechanical effect, as the particles experienced no forces during this experiment 
(neglecting the small forces when entering and exiting the regions of shifted potential). Note that 
in the limit of h + 0, the phase difference (12.9) oscillates very rapidly. When averaged over any 
nonzero time interval, these rapidly oscillating phases cancel out, so the interference effects will 
disappear, as we expect in the classical limit. 


Lecture 12 8.321 Quantum Theory I, Fall 2017 61 


12.2.2 Electromagnetic Fields 


Let us briefly review electromagnetism. We know that we can represent electric fields FE and 
magnetic fields B through scalar and vector potentials as 


10A 
E=-V¢o-- —, B=VxA. (12573) 

c Ot 
Note that we are using cgs units. The potentials @ and A will be extremely useful in describing 
the motion of quantum mechanical particles through EM fields. 

It is useful to introduce a more formal notation, defining the electromagnetic field strength 
tensor 


Fiypt= Oj Ay~ OpAy, (12.14) 


with 
Ay =(¢,A), (u=0,1,2,3). (12.15) 


The electric and magnetic fields are then given by 
Fo = —Fio = -—Ei, Fig = €ijr Br. (12.16) 
The four-vector potential A,, is redundant: the gauge transformation 
Ay Ay + OA (12.17) 


leaves the field strength F,,, invariant, which means the electric and magnetic fields are unchanged. 
The field strength tensor is the real physical quantity, and all of the redundant values of A,, that 
yield the same field strength tensor describe the same physics. 

For a charged particle in an electromagnetic field, the Lagrangian is 


1 
L= sma? 4 “A-%— eg. (12.18) 
CG 


In Lagrangian mechanics, which is formulated in terms of position and velocity, there is a notion 
of momentum as the variable conjugate to the velocity. The canonical momentum is given by 


OL € 
= .-=mae+-—A. 12.19 
P Oa a c ( ) 
Note that the canonical momentum is not gauge-invariant; it depends on the choice of gauge. 
Using this canonical momentum, we find that the Hamiltonian is given by 


da 1 
H=p- L==mk’ +e. 12.2 
p rr aint ed ( 0) 


This is the expected answer; the Hamiltonian is gauge-invariant. Using the definition of the canon- 
ical momentum, we can rewrite this as 


(p — £A)° 
2m 


H= +e. (12.21) 


The main conceptual point here is that the canonical momentum p is different from mv = mie 
(the “kinematic” momentum) when A #4 0. The reason we care about the canonical momentum 
rather than the kinematic momentum is because it is the quantity that has {x,p} = 1 (with {-,-} 


Lecture 12 8.321 Quantum Theory I, Fall 2017 62 


the Poisson bracket), which means that it will have the proper commutator with the operator x 
when we quantize the theory. 

Quantum mechanically, we use the Hamiltonian (12.21) to discuss quantum motion, using the 
commutators 


[zi,pj] = thdi;, [wi, xj] = [pi, pj] =O. (12.22) 
Expanding Eq. (12.21), we have 


2 2 

p e€ e€ 
Gey AGA: 

2m Set - p) + Qmc? 


A’ +e¢. (12.23) 
Note that the ordering of p and A matters here, as they may not commute. 


12.2.3. Gauge Invariance in Quantum Mechanics 


The Schrédinger equation is 


aw _ |(p— $A)” 
rey | 2m Oe 
12.24 
(—inv — £A)’ ae 
= c +e] wy. 
2m 


How should we change the wavefunction so that the theory remains invariant under gauge trans- 
formations? 


Let fk 
A'=A+VA, ¢ =0--=, (12.25) 
c Ot 


which is a gauge transformation, where A(a,t) is a scalar function. Then we have 


2 
0 _ | (p—§A) 
h — Cc 
OE | 2m eo 
: (12.26) 
(—ihV — £A'+ £VA) , eOA 
2m c Ot 
We can rearrange this equation to reach 
no. — eg! 2 OM) y = (—inV - £ aA): aban 
OE c Ot 7 2m , 
We can compensate the changes from the gauge transformation by letting 
Sue Pe (12.28) 
which leaves us with P 
Pa) ! ! (—ihV =e ‘) ! 
Baie = 7 12.2 
(ing, — eo!) ee, (12.28) 


matching the form of the Schrédinger equation in Eq. (12.24). 


Lecture 12 8.321 Quantum Theory I, Fall 2017 63 


Thus, the full gauge transformation is 


A'=A+VA, 

10A 

Gf i 12.30 
ow @ Cc Ot ’ ( ) 


y= aie eae 
All observables must remain invariant under this transformation. 


12.2.4 Aharonov—Bohm Effect 


Consider a hollow cylindrical shell with a solenoid through the middle, such that there is a flux 


o=/ B-ds (12.31) 
core 
through the core, but nowhere in the shell. Suppose that some charged particles are constrained 
to move within the shell, where there is no magnetic field. Classically, the magnetic field does 
not affect the motion of the charged particles, because they experience no forces. What about in 
quantum mechanics? 

In order to formulate the quantum motion, we need to determine the vector potential A (note 
that the scalar potential is zero in this case). Even though the magnetic field is zero in the shell 
where the particles move, the vector potential cannot be chosen to vanish in this region. To see 
this, consider taking the line integral of A around a closed contour C' lying within the shell and 
enclosing the core. Using Stokes’ theorem, we see that this line integral is given by 


paa= [vx a)-as= | B-as-9, (12.32) 


where » is the area bounded by the contour C, i.e., C = 0X. Because there is nonzero flux through 
the core, this integral cannot be zero. Thus, we cannot set A to zero everywhere in the shell. 


Lecture 13 8.321 Quantum Theory I, Fall 2017 65 


Lecture 13 (Oct. 25, 2017) 


13.1 Aharonov—Bohm Effect 


Recall from last lecture the setup for the Aharonov-Bohm effect: we have a cylindrical shell through 
which runs a solenoid, producing a flux 


o=/ B-ds (13.1) 
core 
through the core of the shell. There is no magnetic field inside the cylindrical shell, only through 
its core. As we saw, even though the magnetic field within the shell vanishes, it is impossible to 
choose the vector potential A to vanish inside the cylindrical shell. To see why, we took a closed 
contour C’ within the shell that enclosed the core, and noted that 


paa= [vx a)-as= | B-as=9, (13.2) 


where © is the area bounded by the contour C, i.e., C = OX. This shows us that A cannot vanish 
everywhere within the shell. 

In particular, let us consider the contour C’ to be a circular loop of radius R. The line integral 
above has contributions only from the azimuthal component of the vector potential. Thus, we 
should be able to produce the magnetic field coming from the solenoid by taking a vector potential 
of the form 


A = Agég. (13.3) 
By cylindrical symmetry, it is reasonable to take Ag to be independent of 6. We then have 
f A-dé= Ag(27R) = ©, (13.4) 
Cc 
which yields 
® 
Ag = =. 13. 
6 OnR 2) 
Thus, a vector potential that produces the desired magnetic field is 
AGL se, (13.6) 
2ar 


where r is the radial cylindrical coordinate. Note that there are infinitely many vector potentials 
that will produce the same magnetic field, all related to one another by gauge transformations. 

Because we have a nonzero vector potential, there will be nontrivial quantum mechanical be- 
havior. Thinking in terms of the propagator 


K(w;,tpia,ti) = | [Da] eS", (13.7) 


The action is 


ste(t)| = f at L(x, <1) 


_ Vi C4 
= [a (Gina tA =) 


(13.8) 


Lecture 13 8.321 Quantum Theory I, Fall 2017 66 


where Spo is the action without the presence of any magnetic field. Thus, the amplitude to propagate 
from (a;,t;) to (af, tr) is equal to the amplitude with no magnetic field multiplied by the phase 


ae eee 13.9 
eo(z | (13.9) 

So far this statement is general; let us now specialize to the case of the Aharonov-Bohm setup. 
Consider two points A and B distributed symmetrically around the cylindrical shell, and both lying 
in the same plane orthogonal to the axis of the cylindrical shell. We consider two types paths from 
A to B within this plane: those passing around the left of the cylindrical shell, and those passing 
around the right of the cylindrical shell. Note that our choice of a particular path around the left 
of the shell does not affect the phase accumulated along this path. To see this, note that we could 
choose two such paths and combine them to form a closed loop which encloses no magnetic flux; 
the closed line integral of A along this loop therefore vanishes, showing that the phase picked up 
on the two paths must be equal. In other words, for any two paths Py, and Pj, passing around the 
left, we have 

A-dx = A-da. (13.10) 
Py Pl 

This is similarly true for the paths around the right of the shell. 

The amplitudes when traveling along the left and right paths P, and Pr are therefore 


ay) = niger Ja Ade . 


we (13.11) 
a( PR) = ager !Pp us 
If we consider the difference of the additional phase factors, we have 
A - da — A-de=fA-dx= 40. (13.12) 
Pr Py 
The total propagator is thus 
K= Kier Sp, Ada aa Kpe™ JPp Ada 
ae /f A-da seb /h (13-13) 
= ehc IP, (KL + Ke" / aie 
The probability of propagating from point A to point B is then 
® 
Prob(A + B) = |K,|? + Kal? + 2K. cos( $). (13.14) 
Cc 


This probability oscillates as a function of the flux ®, even though classical trajectories do not see 
the magnetic field. We see that this oscillation depends only on the flux ®, and not on our choice 
of gauge, as is required by gauge invariance. 

We have derived this effect using the path integral approach. We could have also reached this 
result by using the Schddinger equation. The energy levels and wavefunctions for a particle in a 
ring threaded by flux ® are dependent on ®. 

This result leads us to the following question: the vector potential is a (redundant) mathe- 
matical construct that we use to describe the physics, but the actually physical quantities are the 
electromagnetic fields. How can it be the case that the vector potential makes an imprint on the 
physics, when we know that the vector potential by itself is not gauge invariant? To reconcile the 


Lecture 13 8.321 Quantum Theory I, Fall 2017 67 


Aharonov-Bohm effect and the non-reality of the vector potential, we must realize that the physics 
of a quantum particle is sensitive not only to the local magnetic field, but also the global properties 
of the magnetic field. This is a departure from the classical intuition: quantum mechanics requires 
nonlocal effects. Formulating the theory in the way that we have, using the vector potential, allows 
us to define the theory only locally and find the same results. We have formulated these nonlo- 
cal effects in terms of a purely local description, but the price we pay is that there is no unique 
description of the formalism, because we have gauge redundancy. 

The same approach can be seen in quantum field theory. With quantum electrodynamics, 
we describe the physics of charged particles and photons. We all know that two charged particles 
interact via the Coulomb interaction, which is nonlocal; we can think of the introduction of photons 
as taming this nonlocality in the same way as the introduction of the vector potential tamed the 
nonlocality in the Aharonov-Bohm setup. 


13.2 Magnetic Monopoles 


Suppose that magnetic monopoles exist. This requires a modification to Gauss’s law for magnetism, 
V-B=0->V-B=4rpm, (13.15) 


where py is the density of magnetic charge (as well as modifications to Faraday’s law and the 
Lorentz force law). A magnetic monopole is a single magnetic charge of strength g. If there is a 
magnetic monopole of magnetic charge g at the origin, then 


B= Fer. (13.16) 


What is the fate of a quantum mechanical particle of electric charge e moving in the presence 
of a magnetic monopole? We immediately realize a problem. We typically describe the magnetic 
field in quantum mechanics via the vector potential, but the existence of the vector potential is 
predicated on the fact that the divergence of the magnetic field vanishes. In a world where magnetic 
monopoles exist, the magnetic field does not have a representation as the curl of a vector potential. 

Let’s think about this problem in the presence of a single magnetic monopole sitting at the 
origin. In this situation, V -.B = 0 everywhere except at the origin. The implication is that if we 
exclude the point at the origin, then we can locally find a vector potential A such that B = V x A. 
To see more clearly why we cannot describe the magnetic field everywhere with a vector potential, 
consider a sphere of radius r centered at the origin. Due to the presence of the magnetic monopole 
at the origin, we see that 


| B-dS =4ng. (13.17) 
sphere 


If we were able to write B = V x A with A well-defined everywhere, then we would have 
i Bas= [v-Bav=[¥-(Vx A)aV=0, (13.18) 
sphere 


which yields a contradiction. 
Consider a circle C' on the surface of the sphere at angle 9. The circle divides the sphere into 
two caps; the magnetic flux through the upper cap of the sphere bounded by this circle is 


o= an f Sr d(cos #) = 27g(1 —cos@). (13.19) 


Lecture 13 8.321 Quantum Theory I, Fall 2017 68 


If there exists a vector potential 


A= Agéy, (13.20) 
then 
) A-dé= 2nrrsindAg. (13.21) 
Cc 
On the other hand, 
f A-dé= [ B-dS = 27g(1 — cos), (13.22) 
Cc x 


where © is the upper cap bounded by C, i.e., C = 0X. Thus, we have 


1 — cos @) 


Ay = a (13.23) 


rsin@ 


Note that this ratio is well-behaved as 6 — 0, because the numerator goes to zero more quickly 
than the denominator, but the ratio blows up as @ > 7. 


Lecture 14 8.321 Quantum Theory I, Fall 2017 69 


Lecture 14 (Oct. 30, 2017) 


14.1 Magnetic Monopoles 


Last time, we considered a magnetic field with a magnetic monopole configuration, and began to 
approach describing the quantum mechanics of a charged particle moving in such a magnetic field. 
The presence of the monopole violates the condition VB = 0, which means that we cannot globally 
define a vector potential A such that B= V x A. 
We began by considering an isolated magnetic monopole of strength g located at the origin, 
which gives a magnetic field 
B= aren (14.1) 


Consider a sphere of radius r, and a closed circular contour C' at @ on the sphere. The magnetic 
flux through the upper cap “4 bounded by C' is 


Oc = 2mg(1-—cos6). (14.2) 
If we choose 
A= Agég, (14.3) 
then we have 
) A-dé= 2rrsin6Ag = B.dS = 27g(1-—cos6), (14.4) 
C ey 
which gives us 
g(1 — cos 6) 
Age 14. 
¢ r sind (142) 


Where is this potential well-defined? We see that it diverges at r = 0, but we expected this, 
because the magnetic field is divergent at r = 0. As 6 > 0, the cosine in the numerator approaches 
1 more quickly than the sine in the denominator approaches 0, so Ag is well-defined at 6 = 0. 
However, it blows up at 6 = 7. Let us rename this vector potential, as 


g(1 — cos 8) 
yg es Ue a a 14.6 
¢ rsind ( ) 
In calculating this vector potential, we made the arbitrary choice to consider the magnetic flux 
through the upper cap bounded by C' instead of the lower cap. If we instead consider the flux 


through the lower cap, we would find a vector potential A = A; €4, where 


= g(1 + cos 6) 
A; = eas (14.7) 
This vector potential is not well-defined at @ = 0, but is well-defined at 0 = 7. 

Thus, we cannot use A at the south pole, and we cannot use A, at the north pole, but 
everywhere else we could equally well choose either expression for the vector potential. Because 
both of these vector potentials describe the same electromagnetic fields in these regions, they must 
differ by the gradient of some function. Indeed, we see that 


2g 
r sin 6 


At — A = (Af - 45 ey = éy =29V¢. (14.8) 


Thus, as expected, these two vector potentials differ from one another by a gauge transformation. 


Lecture 14 8.321 Quantum Theory I, Fall 2017 70 


We have found that, in order to describe the magnetic monopole field in terms of vector poten- 
tials, we can write B = V x A, where 


_— g(l—cos 6) » us 
or aa co for0<60<F+e, (14.9) 


A- = —S0 086, fort —e<O<0, 
In the region where At and A~ equal A, they are well-defined. In the overlap region, the differ 
by a gauge transformation. 


14.1.1 QM of a Charged Particle Moving in a Magnetic Monopole Field 


Consider a particle of electric charge e. For 0 < 6 < } +¢, let the wavefunction be 7*(r, 6, ¢), and 


for § —€ <@< 17, let the wavefunction be ~(r,@,¢). Assume that we have already determined 


these by solving the Schrédinger equation. In the overlap region, w* and w~ must be related by a 
gauge transformation. We saw in a previous lecture that if we make a gauge transformation 


At =A” +29V¢, (14.10) 
then the wavefunction must change by 
pt = pr erese she | (14.11) 


Now, we note that the wavefunctions yt and w~ must be single-valued when ¢ > ¢+ 27. This 
requires that 


2eg 
— = Z, 14.12 
ic a ( ) 
Le., 
nhe 
=, EZ. 14.13 
g 27e E ( ) 


This is the Dirac quantization condition: the magnetic monopole field strength must be quantized, 
with the quantum a function of the electric charge. 

We can gain intuition about this result with a vague classical analogue. Note, however, that this 
quantization is a purely quantum effect, and so cannot be fully explained classically. Let’s consider 
the charged particle moving in the field of an magnetic monopole from a classical point of view. 
Consider an electric monopole of strength e and a magnetic monopole of strength g, both static, and 
displaced from one another by distance d along the z-axis. We can then ask about the total angular 
momentum stored in the electromagnetic field; this information is contained in the Poynting vector, 
which is proportional to FE x B. Purely by symmetry, we conclude that the angular momentum 
must be directed along the z-axis. If we carry out the calculation, we find that the total angular 
momentum is independent of the distance d, and is proportional to eg. In quantum mechanics, we 
know that angular momentum is quantized. If we require the total angular momentum we found 
to be an integer multiple of R then we recover the Dirac quantization condition. Incidentally, this 
gives us a model of what is “spinning” in a spin-5 particle: a bound state of a bosonic magnetic 
monopole and a bosonic electric monopole has spin-5. The naive statistics of this bound state 
would be bosonic, because both the electric and magnetic monopoles are bosonic; however, the 
interactions between the two charges lead to a change in the statistics. 


Lecture 14 8.321 Quantum Theory I, Fall 2017 71 


14.2 Charged Particle in a Uniform Magnetic Field 


Consider a particle of electric charge e moving in a uniform magnetic field B = BZ in three 

dimensions. The Hamiltonian is 

(p- A)" 
Qi: 


H= (14.14) 


Only Bz # 0, and so we can always choose Az = 0 and Az, Ay independent of z. The Hamiltonian 


then becomes : ‘ 5, 
_ Pz, Tl, +, 


H | (14.15) 
2m 2m 
where 3 
Tey = Pry — — Any (14.16) 


are the kinematic momenta in the z- and y-directions. Note that [p,, H] = 0, so we can label the 
eigenstates by p,. We can then write the Hamiltonian in the form 


2 


|: ree eee | (14.17) 
2m 
with , : 
I? +1 
Hog = ——2 14.18 
2d 2m ) ( ) 


and we only have to determine the spectrum of Hog. 
The trick is to notice that I, and Il, have a simple commutation relation, 


B 
(Me, Hy] = | ihd, = Ae, ihd, = Ay = =i. (14.19) 


Thus, these two kinematic momenta (appropriately rescaled) are canonically conjugate variables, 
and the Hamiltonian Hq looks like the sum of squares of canonically conjugate variables, which is 
the Hamiltonian of the simple harmonic oscillator in one dimension. More precisely, let 


cll 
= a 2). PS Tbi, (14.20) 
Then, [X, P] = ih, and 
PP ANTE: Pet 
Aog = — + ——|[| — 2 4 my? X? 14.21 
a tao l =) am get ( ) 
with . 
eee (14.22) 
me 


This is the one-dimensional SHO Hamiltonian, with frequency w,, known as the cyclotron frequency. 

As an aside, the classical motion of a charged particle a uniform magnetic field is described by 
circular orbits in a plane orthogonal to the magnetic field. Matching the centrifugal force with the 
force from the magnetic field, we have 


mv? evB 
Se Se 14.23 
= (14.23) 
which gives a radius of 
Rae (14.24) 


Lecture 14 8.321 Quantum Theory I, Fall 2017 72 


known as the cyclotron radius. The time period of the orbit is 


QR 2 
pa (14.25) 
v We 


with w, the cyclotron frequency. This is the classical origin of the cyclotron frequency. 
Now, we have the Hamiltonian 


Be Lon acs 
with [X, P] = ih. We can immediately conclude that the energy levels are 
(24) : 
EY? = hol n+ a) ° (14.27) 
The three-dimensional energy levels of the full Hamiltonian H are then 


2 
EGY (p,) = Pe 4 h(n + ) (14.28) 
m 


However, we are not done, because we do not know the degeneracies of these energy levels. We will 
find that the spectrum is highly degenerate. 
14.2.1 Degeneracy 


Why is there degeneracy in the spectrum? One way to understand the degeneracy is to notice that 
we can define new coordinates in the problem, 


Cc Cc 
Ry i= at opi == opt: (14.29) 
Note that [Rz, Ry] = —zgih. Thus, R; and R, are canonically conjugate up to a multiplicative 
factor. Furthermore, we note that 
Cc 
[Re Ux] = [x, T,] + op Uy, Me] 
a ee 5 (Sin) (14.30) 
eB\ c¢ 
=0. 
Similarly, we find that [R,,IIl,] = 0, and more generally, 
[R;,11,] =0. (14.31) 
Thus, 
[R;, H] =0. (14.32) 


We have two operators that each commute with the Hamiltonian, but they do not commute with 
one another. Recall from a previous homework that when this is the case, the Hamiltonian must 
be degenerate. 

What is the physical meaning of these coordinates R;? Recall that classically, the particle 
undergoes circular motion in the presence of the uniform magnetic field. Classically, the vector 
R = (Rz, Ry) is the center of the cyclotron orbit: if (x,y) are the time-dependent coordinates 


Lecture 14 8.321 Quantum Theory I, Fall 2017 73 


of the particle moving in its circular orbit, and we take I]; = mu;, then (Rz, Ry) are the time- 
independent coordinates of the center of the orbit. This point is called the guiding center. We 
see that in quantum mechanics, the coordinates of the guiding center do not commute with one 
another in the presence of the magnetic field. The size of the cyclotron orbit will be fixed such that 
the magnetic flux through the orbit yields one of the quantized energies, but the location of the 
orbit is not fixed, which leads to the degeneracy. These degenerate energy levels are called Landau 
levels. 


Lecture 15 8.321 Quantum Theory I, Fall 2017 74 


Lecture 15 (Nov. 1, 2017) 


15.1 Charged Particle in a Uniform Magnetic Field 


Last time, we discussed the quantum mechanics of a charged particle moving in a uniform magnetic 
field B = Bz. We decomposed the Hamiltonian as 


2 


| ee eee (15.1) 
2m 
with - . 
Ie +1 
Hog = ——* (15.2) 
2m. 
The Hamiltonian Hq has spectrum 
1 
EO? — hu, (w a 5) . (15.3) 
with the cyclotron frequency given by 
B 
Se (15.4) 
mec 


We then wanted to determine the degeneracy of these energy levels, called Landau levels. We 
defined the guiding center coordinates, 


Cc ie 
Re = 2+ ally, fy =y— =e 


These variables are conjugate, up to a multiplicative factor, 


Tl, . (15.5) 


[Re, Ry] = —<pih = —if2,, (15.6) 


and both commute with the Hamiltonian. As we showed on the homework, this implies that the 
Hamiltonian is degenerate. Here, 2p is the magnetic length, which is the length scale such that if 
we cut out a circular disk of radius €g orthogonal to the magnetic field, then the flux through the 
disk is on the order of one flux quantum: 


qBe, = — ~ — = Oy, (15.7) 


where ®o is the flux quantum. 

The guiding center coordinates are the coordinates of the center of the cyclotron orbit, which 
are a constant of the motion. This statement is equivalent to the statement that the guiding center 
coordinates commute with the Hamiltonian. In the quantum theory, the size of the orbit is set by 
the only length scale in the problem, the magnetic length. We expect that the degeneracy is the 
number of flux quanta passing through the sample, ~ BA where A is the area of the sample. 

Now we will calculate the degeneracy explicitly. We pick a particular gauge to work in, for 
convenience. We choose 


Ay = Be, Ag=0. (15.8) 
This is known as Landau gauge. In this gauge, 
2 _ eBa 2 
Hoa = Pr ah (Py c ) 
ae 2m (15.9) 
ee oe as Al es eae 
2m BS ghee eB 


Lecture 15 8.321 Quantum Theory I, Fall 2017 75 


In this gauge, [p,, H] = 0, so we can label states by their p, eigenvalue. For any fixed py, we get a 
one-dimensional simple harmonic oscillator of frequency we, which reproduces the spectrum 


E24) — fu, (w + 5) : (15.10) 


The energy eigenfunctions take the form 


Vnpy = CP by (x 2 =) (15.11) 
This is a plane wave multiplied by a shifted SHO wavefunction. It is now clear that states with 
different values of p, will be degenerate. The degeneracy question then becomes the question of 
how many distinct values of p, exist for a system of a given area. 

Consider a sample of lengths Lz and Ly along the x- and y-directions, respectively, with periodic 
boundary conditions along the y-direction. We then must have 


ePuby/h — 1, (15.12) 
which gives us quantized momentum modes along the y-direction, 


onmh 
py=——— ,meZ. (15.13) 
Ly 


Each of the degenerate wavefunctions np, (7,y) has a modulus lon (x - ry) and is centered at 


o= ee (a) (15.14) 


However, we cannot arbitrarily shift these SHO groundstate wavefunctions, because the sample 
has a finite length LZ, in the x-direction. Thus, the y-direction momentum is quantized because of 
the finite length in the y-direction, and has a finite number of possible values because of the finite 
length in the x-direction. The number of independent states is then 

L BL,L BA BA 


(<5) (32) 2rhe he Bp’ 


as we expected. 
In classical mechanics, we can prove that there is no diamagnetism. In classical statistical 
mechanics, everything is determined by the partition function, 


ru fT one) si 


In the presence of a magnetic field, the Hamiltonian changes, 
dar dp, -0(S, CHEF ven) 
15.17 
x f TI eat oe 


However, we can simply shift the momenta as p, = p;— £A(a;), which then gives us Z[A] = Z[A = 
0]. Thus, classically, the magnetic field does not affect the statistical mechanics. However, we have 
just seen that in quantum mechanics, there is an effect. This effect is purely quantum mechanical. 


Lecture 15 8.321 Quantum Theory I, Fall 2017 76 


We might wonder how a two-dimensional problem of motion in the x, y-plane turned into a 
one-dimensional SHO. The electron is localized around the guiding center coordinates. However, 
we see that the two coordinates do not commute with one another. Thus, there is only one real 
coordinate in the problem, and the other coordinate is its conjugate variable. This is why we are 
only solving a one-dimensional problem at the end of the day. Note that because R, and R, do 
not commute, we cannot be in a simultaneous eigenstate of both, so the location of guiding center 
itself must be smeared out. 


15.2. Composite Systems 


We now consider composite systems, which are systems composed of two (or more) subsystems. 
Suppose we have two systems A and B, with Hilbert spaces H4 and Hz, respectively. We want to 
obtain the Hilbert space of the composite space A + B, which is called the tensor product of H, 
and Hp. 

Consider a basis |{¢;}) of H4 and a basis |{x;}) of Hg. We will define new vectors |¢;) ® |x;) 
that live in a new Hilbert space, denoted H4+p = H4® Hep. These |¢;) ® |x;) will provide a basis 
for the Hilbert space H41,, meaning that any state in H4,, can be decomposed as 


Wa+B) = S- digl ds) @ |xj)- (15.18) 
iJ 
Often times, we will omit the tensor product symbol ® for convenience, when the meaning is clear. 
Thus, we will write |¢;)|x;) = |¢i) ® |x;). 

We can then write down operators on 144, and formulate the quantum theory as we have 
before. There are classes of operators that act on only one of the two subsystems, such as O = 
O,4 ® 1p, but in general we can have operators that act nontrivially on both subsystems at once. 

Consider, in particular, a system of two spin-5 particles A and B. Some possible states in the 
composite Hilbert space can be written as a single tensor product, such as 


1 
V2 
There are clearly an infinite number of such states. There are also more interesting states (entangled 
states) that cannot be written as a single tensor product, such as 

1 
v2 


Some of the operators on this Hilbert space act only on one of the subsystems, such as 04®@1p := 


o%, (the second way of writing this is a notational convenience). However, there are more general 


operators that act nontrivially on both subsystems simultaneously, such as 0% @ of} := oo}. 


Ita) @ltB), Wa) @lts), |ta)®@ (Ite) + |Le))| - (15.19) 


(Ita) ® le) — [La) @ Ita)). (15.20) 


15.2.1 Quantum Entanglement 


Two subsystems A and B of a quantum mechanical system can be entangled with one another. For 
example, consider again the system of two spin-5 particles A and B. Some states, such as 


Ita) ®@\LB), |ta) ®|LB), (15.21) 


have the property that if we restrict our view to only one of the subsystems, then the system is in 
a well-defined state of that subsystem. By contrast, if we consider a state such as 


1 


V2 


(Ita) @ ta) = \L4) @ Itz), (15.22) 


Lecture 15 8.321 Quantum Theory I, Fall 2017 77 


then we see that neither spin by itself is in a definite quantum state, despite the fact that the 
composite system is in a definite state. This is an entangled state. 

Entanglement is a statement about the relation of subsystems of a system to one another. It 
does not make sense to ask if the state of a system “is entangled.” Instead, we can make statements 
about whether two subsystems are entangled with one another. If we have subsystems A and B of 
some composite system, then an unentangled state is precisely one that can be factored as a tensor 
product of states of the two subsystems, 


lba+B) = |Wa) @ |vp)- (15.23) 


An entangled state is one that cannot be factorized in this way. 


Lecture 18 8.321 Quantum Theory I, Fall 2017 79 


Lecture 18 (Nov. 13, 2017) 


18.1 Symmetries in Quantum Mechanics 


A symmetry is a physical operation we can perform on the system that leaves the physics unchanged. 
As an example, consider a free particle, 


2 
Pp 
Heree = oe 


ae: (18.1) 


This Hamiltonian does not depend on position, so we can translate x — «+a. Under this transfor- 
mation, Hfree is unchanged and [x + a, p] = ih, implying that both H and the commutation algebra 
are unchanged. Thus, translation is a symmetry of the free-particle system. This Hamiltonian has 
several other symmetries. Inversion (or parity), given by x + —2x,p > —p, preserves [x, p] = ih and 
Agee, and so is asymmetry. Time reversal, which sends t — —t, is also a symmetry; we will discuss 
its realization in quantum mechanics later. The system is also invariant under Galilean transfor- 
mations, even though Galilean boosts p + p+ mvog seem to change the Hamiltonian. Galilean 
boosts change the phase in the path integral by something that does not depend on the path, so 
the probabilities do not change, but the amplitudes do; this example is more subtle, and so we will 
stick to discussions of the other symmetries of the system. 


18.1.1 Symmetry Transformations 


We define a symmetry operation as a linear transformation U: |a) > |a’) such that all results of 
measurement are preserved. This means that 


| (o'la’)? = |(bla)? (18.2) 
for all a,b € H. From the definition of the primed kets, this gives 
t 2 
[(ojrteja) | = |(bla)?. (18.3) 


Wigner’s theorem tells us that if U satisfies this condition, then U must be unitary or anti-unitary; 
an anti-unitary operator U is one that satisfies 


Ula) = KU|U) = (Uja))', (18.4) 


where K is the complex conjugation operator, and U is some unitary operator. If we assume that 
U is linear or anti-linear, meaning that 


o( aie =) > ula), (18.5) 


then we can show Wigner’s theorem fairly simply (this will appear on your homework). Wigner’s 
theorem can be proved without making this assumption, but the proof is more subtle. 
18.1.2 Continuous Symmetries and Conservation Laws 


Consider a symmetry operation described by a unitary operator U. This is a symmetry of the 
Hamiltonian if H is unchanged by the action of U, i.e. 


H=U'HU, (18.6) 


Lecture 18 8.321 Quantum Theory I, Fall 2017 80 


ie., [H,U] = 0. If the symmetry transformation can be continuously built up as a series of 
infinitesimal transformations starting from the identity operator 1, then we call it a continuous 
symmetry. Translation and rotation are examples of continuous symmetries: any translation or 
rotation can be built up from the identity by doing a sequence of infinitesimal translations or 
rotations. Symmetries that cannot be built up in this way are called discrete symmetries. Parity 
is an example of a discrete symmetry (in even dimensions, we can construct parity as a rotation, 
but it is nonetheless possible that the Hamiltonian is symmetric under parity but not rotations, so 
parity is still discrete). 

Let us now consider continuous symmetries. All of the information about a continuous symmetry 
is contained in its infinitesimal form, because we can always build up a continuous transformation 
from its infinitesimal version. As we have seen previously, an infinitesimal unitary operator can be 
expressed in terms of a Hermitian operator: if we expand an infinitesimal unitary operator U as 


U=1- ate + O(e), (18.7) 


then the statement U'U = 1 implies GT = G, ie., G is Hermitian. The statement that U is a 
symmetry is U'HU = H, ice., 


(1+ F6+0(2))H(1-Fe+0(2)) Hs (18.8) 


which gives us [G, H] = 0. Because G is Hermitian, it corresponds to some observable, and we now 
have 


dG 


=| 
dt ih 
so we see that G is conserved. This leads us to the following general statement: 


G,H] =0, (18.9) 


Theorem 4 (Noether’s Theorem). For every continuous symmetry of the Hamiltonian in quan- 
tum mechanics, there is a corresponding conserved quantity. Conversely, if some observable G is 
conserved, then |G, H] = 0, and we can define unitary operators 


U(0) = c0G/h (18.10) 


which will satisfy U(0)'HU(0) = H, showing that U(0) is a continuous symmetry of the Hamilto- 
nian. 


18.1.3 Translations 
As an example, consider again translation, x > «+a. The translation operator is 
T(a) =e P/F, (18.11) 


This satisfies (T(a))' = (T(a))~!, ie., the translation operator is unitary. It has the following 
properties: 


e (T(a))* = T(-a), 
e T(a')T(a") = T(a' +a"), 


e Ti(a)zT(a) =x +a. 


Lecture 18 8.321 Quantum Theory I, Fall 2017 81 


For infinitesimal a, we can expand the translation operator as 
7a 
Ta) 1— =p, (18.12) 


where p is the Hermitian generator of T(a). If a system has translation symmetry, then momentum 
is conserved, from Noether’s theorem. In d dimensions, we generalize to translations x7; > x; + aj, 
with i=1,...,d. Then, the unitary translation operator is 


T({ai}) = [[n@) = II ene (18.13) 


It is a geometric fact that translations in different directions commute, [T;(a;),7j(a;)] = 0, for all 
i,j. This tells us that the Hermitian generators commute, [p;,p;] = 0, for all i,7. We can define 
momentum as the observable that corresponds to the Hermitian generator of translations; then 
taking a to be infinitesimal in the identity 

ePalhge—walh 74a (18.14) 


gives us the commutation relation [:, p| = th. 


18.1.4 Time Translations 


For a closed system, time translation is a symmetry, and the corresponding unitary operator is the 
time-evolution operator, ; 
U(t) = e tHe (18.15) 
The Hamiltonian H is the generator of this symmetry, and so the fact that the system is invariant 
under time translation implies the conservation of energy, 
dH 
—=0. 18.16 
ar (18.16) 


18.1.5 Rotations 


If we have a d-dimensional system with position coordinate x;, then a rotation is given by 


d 
xi x = De Rij; = Rix; 5 (18.17) 
j=l 
such that the scalar product is preserved, 
x-y=a2'-y'. (18.18) 


Here we are using the Einstein summation convention that repeated indices are summed over. The 
preservation of the scalar product can be written as 


Rij RikXjyk = ii » (18.19) 
which implies 
Rij Rik = Ojk ; (18.20) 
ie, RR™ = 1. That is, R is an orthogonal matrix. 

The orthogonality condition, RR™ = 1, implies that (det R)? = 1, meaning that det R = +1. 
We see then that there are two classes of rotations. Matrices with det R = —1 are not continuously 
connected to the identity; they involve inversions x; — —«; combined with an ordinary (orientation- 
preserving) rotation. (Note that in even dimensions, parity is orientation-preserving.) For now, 
we will stick to considering rotations R that have det R = +1. Each rotation R is represented in 
quantum mechanics by a unitary operator D(R) that acts on the system’s Hilbert space. 


Lecture 19 8.321 Quantum Theory I, Fall 2017 82 


Lecture 19 (Nov. 15, 2017) 


19.1 Rotations 


Recall that rotations are transformations of the form x; > Rix; (using Einstein summation nota- 
tion), where R is an orthogonal matrix, R'R = 1. This R is called a rotation matrix. For now, we 
will restrict to rotations R with det R = +1 (orientation-preserving or proper rotations). 
Every rotation R of space corresponds to a unitary operator D(R) on the Hilbert space, which 
satisfies 
D(Ri)D(R2) = D(Ri Ra). (19.1) 


We will discuss this composition property more later. A quantum state transforms as |a) + |aR) 
under this rotation, such that 


lar) = D(R)\a) . (19.2) 
For a vector operator V;, with i = 1,...,d, we require 
(Bri Vilar) = Rij(B|Vjla) . (19.3) 


Expanding the transformed bra and ket, this is 
(3|D'(R)V;D(A)la) = Rij(BIVj la). (19.4) 
This is true for any states |a@),|G), which implies that the operators must be equal: 
DI(R)V;D(R) = RijV; (19.5) 


holds as an operator equation. 
Consider the infinitesimal rotation R = 1—w. The orthogonality condition, R'R = 1, then 


implies that w! = —w. Thus, w is a real, antisymmetric matrix. We can then expand D(R) in the 
form . 
u 2 
D(R) x1- ah dey + O(w ) : (19.6) 
This expansion identifies the objects Jj; = —Jj; as the Hermitian generators of rotations. Note 


that the antisymmetry of J;; follows from the antisymmetry of w;;. 
Let us now specialize to three dimensions, d = 3. In this case, we can write 


a 
D(R) =1— 5 (Siawia + Joswa3 + Jziws1) + O(w’), (19.7) 


where we have used the antisymmetry of J;; to group terms. We define 


J, = J23,  Jo:=J31, Ja = Siz, (19.8) 


1 
J; = 9 cisk Tj (19.9) 


where €;;, is the totally antisymmetric symbol with €,23 = +1, known as the Levi-Civita symbol. 
We can similarly define 

6; s= W923 , A [= W31, A3 [= W112, (19.10) 
Le., 


1 
Di = HeijhW jk» Wij = CijrFh - ey 


Lecture 19 8.321 Quantum Theory I, Fall 2017 83 


Then, we have 


D(R) =1— Gs +0(6?). (19.12) 
Note that 
Ry = 0ij — Eijk OK + O(67) : (19.13) 
Thus, 
Ly ee = Rizr; = (di; = igh Ok) 25 = Xi Cignt 70 (19.14) 
Le., 
xoe =2+O0xe2. (19.15) 


Thus, the meaning of @ is that x is rotated by an angle |@| about the 6-direction. 
We now define J, to be the components of the angular momentum. First, we will derive the 
commutation relations of J, with any vector operator V;. We start with the equation 


D'(R)ViD(R) = RijV; (19.16) 


and take R = 1 —w with w infinitesimal. The left-hand side then becomes 


(: - udev (1 _ mi) V4 wh (Je, Vd (19.17) 


while the right-hand side becomes 
Ri Vj = V; = A . (19.18) 


Thus, we conclude that 
(Je, Vi] = the Riz Vj ‘ (19.19) 


We can then use a combination of rotations to deduce the angular momentum algebra, via 
D(R1)D(R2) = D( Ri Ro). (19.20) 
In particular, this composition rule implies that 


D(Ro)D(Ro)D(R5') = D(RgRoR,') (19.21) 


The rotation RgRoRy! can be written as a single rotation Rg for some 0’. As @ itself is a vector, 
for @ infinitesimal, we have 
0 =0+¢x 49. (19.22) 


If we take @ to be infinitesimal, we have 


Oy. J 
D(Re) =1- —* 


+ O(6), (19.23) 


so (19.21) becomes 


0 D(Rg) kD( R') = O',Jp (19.24) 


The left-hand side of this equation, for infinitesimal @ is 


a.(1 7 ih) na + a) Shin 1 ae, (19.25) 


h h 


Lecture 19 8.321 Quantum Theory I, Fall 2017 84 


while the right-hand side is 
Dede + €jkePjOnIe ++: , (19.26) 


which leads us to conclude that 
[Ji, Jj] = ihegndk - (19.27) 


This is the angular momentum commutation algebra. Note that this matches the commutation 
relation for a vector operator with the angular momentum operator, so this shows that the angular 
momentum operator is a vector. 

In general, we can write the angular momentum as 


J=L+S, (19.28) 


with L = xx pis the orbital angular momentum and S is an internal property that commutes with 
x, p, etc. We can check that ZL on its own satisfies the angular momentum commutation algebra, 
so the operator J will satisfy the angular momentum algebra if S does. The operator S is the spin 
operator. 

If the Hamiltonian is rotationally invariant, then [J;, 1] = 0, which implies that 


dJj 
dis ° 


0, (19.29) 
and so angular momentum is conserved. 


19.1.1 Eigensystem of Angular Momentum 
Let us now understand the implications of the commutation algebra 
[Sey Jy = the ijn Jk 4 (19.30) 


You will show on the homework that 
(Pesce) =O (19.31) 


This means that we can diagonalize J? and one component of the angular momentum, say J-, 
simultaneously. We can then label the eigenstates of J, by |j,m), with 


J?7\j,m) =alj,m), Jz|j,m) = dlj,m), (19.32) 


for some eigenvalues a,b. The meanings of the values 7 and m will become apparent shortly. 
It is useful to define the ladder operators 


J4. = Jy tidy, (19.33) 


which satisfy 
evden s. Veda Saas Ul Fe we | =O (19.34) 


Using these commutation relations, we see that 


J(J+|J,m)) = (JJ, = hJs)|j,m) 
= (b+h)(Js|j,m)). 


(19.35) 


Thus, J4|j,m) is also an eigenstates of J, with eigenvalue b+ h. 


Lecture 19 8.321 Quantum Theory I, Fall 2017 85 
We can write 
P= h+ I+ Iz 
1 
= Jz + s(JpJ_ + IIs) (19.36) 
1 
=2+5(Hoh +o), 
which tells us that J? — J? is positive semi-definite, 
(j,m|J? — J2|j,m) > 0. (19.37) 


This implies that a — b? > 0 for all eigenstates. For a fixed a, this means that |b] has a maximum 


value bmax. This seems to be in conflict with the statement that we can use J+ 


. to raise or lower 


the eigenvalue arbitrarily. We conclude that at b = +0max the state must be annihilated by J,, 


and similarly, at b = —Dmax the state must be annihilated by J_. 
Call |max) the state with b = +bmax. Then, we have 


J,|max) =0, (19.38) 
which implies 
JoJ4\riax) = 0, (19.39) 
Expanding the ladder operators, this becomes 
(Jz — iJy)(Jz + iJy)|max) = (J? — J? — AJ,)|max) = 0. (19.40) 
This gives us 
a— bag — hb = 0, (19.41) 
1.€., 
a = bmax(bmax + fi). (19.42) 
Repeating this argument for the state |min) with b = —bmax yields the same result. 


We now note that, because J; increases the eigenvalue 6, and this eigenvalue is bounded above 
by bmax, we must be able to reach |max) from |min) by repeatedly applying J;. Say we can reach 


|max) from |min) by n applications of J,. This implies that 
bmax = —bmax + nh, 


SO 


Drax = oe = gh, 
with j € 4Z. We can then read off the eigenvalues for the state |j,m), 
2 


a=hj(j+1), b=mah, 


and see that m can take any of the 27 + 1 values —j,-j +1,...,j7 —1,7. 


(19.43) 


(19.44) 


(19.45) 


Lecture 20 8.321 Quantum Theory I, Fall 2017 87 


Lecture 20 (Nov. 20, 2017) 


20.1 Matrix Elements of Angular Momentum Operators 


Assume that we have normalized |j,m). By definition, since these are eigenstates of J? and Jz, 
these operators are diagonal, with 


(i, m'|P7j,m) = GG + 1)R76j7 Simm’ 5 (20.1) 

(j’,m'| Jz|j,m) = MAO 55" Omm! . , 

Now we only need to compute the matrix elements of J; and Jy. We will make use of the 
identity 

(j,m|J-J_|j,m) = (j,m|J? — JZ — AJe|jm) = W?(5(9 +1) —m? —m). (20.2) 


We also know that 


We still need to determine the coefficient Ee ) Using (20.2) and J_ = a a 4, we see that 


GOP = m2 +1) — mm +1) = PG — mG tm 41). (20.4) 


Ce 


We will choose (by convention) for c; iz to be real and positive, which then gives us 


A similar argument gives us 


J_\j,m) =h/G+Em)G —m+1lj,m—1). (20.6) 


Packaging these results together, we have 


i 


(j/,m'| Ja |g, m) = Ra/(G = m)(GF EMF 1)b5j 5m ms - (20.7) 


We see that when m = +7, the state is annihilated by J,, and when m = —j, the state is annihilated 
by J_, exactly as required. 

We can now easily calculate the matrix elements of J; and Jy, as they are linear combinations 
of J,. This allows us to explicitly write matrix representations of J for fixed 7. As an example, 
consider j = 1. In this case, 


(20.8) 


where the basis is ordered as m = +1,0,—1. Using the matrix elements for J, we can write down 


0 V2 O 
JI_ oR 
Je at =5{v2 0 v3}, 
4/2 0 
(20.9) 
ic 
Jy = =~ = =| V2i —/2i 


Lecture 20 8.321 Quantum Theory I, Fall 2017 88 


For any fixed j, we can now also construct matrix elements of the finite rotation operator D(R). 
For a fixed j, the angular momentum operators are (2j + 1) x (27 + 1) Hermitian matrices, and so 
the rotation matrix D(R) will be (27 +1) x (27 +1) unitary matrices. Consider a rotation by angle 
? about the axis n. First, we construct an infinitesimal rotation by 6@ about the axis n: 


Dag) = 1s HJ -A)50 + O(662) (20.10) 


We can then build up a finite rotation by multiplying infinitesimal rotations N times and taking 
N7> ow, 


Dion ace _ 
7 Jim (2 sd on) o((o/x}) ) (20.11) 
eo 10IT-h/h 


20.2 Rotation Groups 

In general, symmetry operations form a mathematical structure called a group. 

Definition 1. A set {g;} := G forms a group if there exists a multiplication operation - such that: 
e for all gi, g2 € G, 91-92 © G; 


e multiplication is associative: for all 91, 92,93 € G, 
(91-92) 93 = 91° (92-93); (20.12) 


e there exists 1 € G such that 


e for every g € G, there exists g~! € G such that 
g:g =g -g=l1. (20.14) 


For example, the set of 3 x 3 orthogonal matrices forms a group under the operation of matrix 
multiplication. This group is known as the orthogonal group O(3). If we restrict our attention to 
rotations with det R = +1, then we get a subgroup (a subset that is itself also a group under matrix 
multiplication) called the special orthogonal group SO(3). 

There is another pertinent group to the discussion of rotations, which is the group of 2 x 2 
unitary matrices satisfying det U = 1. Any matrix of this kind can be written in the form 


U(a,b) = @ 4 (20.15) 
with |a|? + |b/? = 1. Such matrices form a group under matrix multiplication, known as the special 
unitary group SU(2). 

In quantum mechanics, rotations are represented by unitary operators D(R). These unitary 
operators must themselves form a group, because the product of two of these operators must give 
another, the composition is associative, there is an identity operation, and D(R*) is the inverse of 
D(R). In particular, we can see that the D(R) must satisfy the same group identities as the group 
of rotations R. 


Lecture 20 8.321 Quantum Theory I, Fall 2017 89 


The matrix elements of D(R) are 


pi 


m!',m 


(R= Gamer an) (20.16) 


Note that we have chosen the same j for both the bra and ket; this is because D() commutes with 
J”, so a rotation cannot change the value of J?. As a result, in the full ket space, the (infinite- 
dimensional) matrix for D(R) is block diagonal in the (j,m) basis, where each block corresponds 
to one value of 7. The block corresponding to j will be a (27 + 1) x (27 + 1) submatrix. 

An explicit set of matrices that satisfies the same group identities as a group is called a (linear) 
group representation. Thus, the D(R) form a representation of SO(3). In particular, this represen- 
tation is called a completely reducible representation, because it can be written in block diagonal 
form. Each block on the diagonal of D(R), taken as a standalone matrix Do, for fixed j, is an 
irreducible representation (sometimes referred to as an irrep), as it has no invariant subspaces. 

Consider j = 1/2. In this case, m and m’ take on the values +3. The rotation operator matrix 


elements are 


DY?)(9, 4) = (lee) (20.17) 


m!,m 


That is, the rotation operators are 


pi/2) (0,72) = e a-n/2 


(20.18) 
= cos = —i(o-n)sin—. 
2 2 
Explicitly, 
6. a) 2a, Ze a) 
DO) (6, ”) = gee ei ie 2, ( so ny) ue ; (20.19) 
(—ing + ny)sin5 cos 5 + in, sin 5 


This may seem unusual, because this is an SU(2) matrix. The operators D(/2)(6,%) form a 
representation of SU(2), even though we originally set out to find a representation of SO(3). 
Rotations of 3D space can be characterized either by an SO(3) matrix R (a three-dimensional 
irrep) or by an SU(2) matrix U (a two-dimensional irrep). But SO(3) and SU(2) are not in one-to- 
one correspondence. Suppose we consider a rotation by 6 = 27 along some axis n. For SO(3), 


R(2Q7,n) = 13, (20.20) 


while for SU(2), fost 


The result is that for any spin-5 system, the phase of the wavefunction is rotated by a under a 
physical 27 rotation. 


Lecture 21 8.321 Quantum Theory I, Fall 2017 90 


Lecture 21 (Nov. 22, 2017) 


21.1 SO(3) versus SU(2) 


As we saw, if we study only the representations of SO(3), we will only find representations with 
integer spin. However, by considering the Lie algebra s0(3) = su(2), we found representations 
with half-integer spin. These should properly be considered as representations of SU(2), which is 
a double cover of SO(3), but has the same Lie algebra. Linear representations of SU(2) include 
both integer and half-integer spins, while linear representations of SO(3) include only integer spins. 
As representations of SO(3), the half-integer spin representations are what are called projective 
representations, meaning that they satisfy the group structure only up to a phase: 


D(R1)D(R2) = e®*1-F2)D(Ry Ro). (21.1) 


The elements of SU(2) are in two-to-one correspondence with the elements of SU(3), which 
is why we say that SU(2) is a double cover of SO(3). The two SU(2) matrices U and —U both 
correspond to the same SO(3) matrix R. This can be seen directly from the expression 


hie = 5 r(0;U'o0) , (21:2) 


which expresses a 3 x 3 rotation matrix R in terms of a 2 x 2 unitary matrix U. On the homework, 
you will prove that R defined in this way is a proper rotation matrix. 


21.2 Addition of Angular Momentum 


Given two systems 1 and 2, with angular momenta 7, and j2, respectively, what are the possible 
total angular momenta for the combined system? We first must choose a basis for the combined 
Hilbert space H; ® H2. We will choose the basis 


131,71; J2, M2) = |j1-™1) ® |j2, m2) . (21.3) 


We know how to write down operators on this tensor product Hilbert space. For example, the 
total angular momentum is 


J=J,4+ J2:=J, 8124+11;8 J, (21.4) 


where the rightmost expression makes the meaning completely clear, and the middle expression is 
a convenient shorthand that suppresses the products with the identity. Note that this combined 
operator satisfies the angular momentum commutation algebra, [J;, Jj] = iej,Jz. (If we took the 
difference of two angular momenta, rather than the sum, then the result would not satisfy the 
angular momentum commutation algebra.) Note also that 


Ji, | =i (21.5) 


because the spin operators on one subsystem act trivially on the other subsystem. 
We have 
P= +5 +2K- Jo, (21.6) 


from which we see that 
[de Jas | 0, (21.7) 


Lecture 21 8.321 Quantum Theory I, Fall 2017 91 


where a = 1,2. Thus, we see that the eigenvalue of J? is not a good quantum number in the basis 
we have chosen, in the sense that the basis we have chosen is not an eigenbasis of J. 

However, because J satisfies the angular momentum commutation algebra, we know that J? 
and J, can be simultaneously diagonalized. What are the allowed eigenvalues of (J a? Jy) given Ji 
and Jy)? As an example, if we consider two spin-3 systems, there are four states in the combined 
Hilbert space, which can be written in the form 


1 


Min = 5:1 da = 5972) = [+4),1+-)|-Hs|--)- (21.8) 


Clearly, 
Jz|++) =|++), (21.9) 


which implies that m = +1 for |++). Similarly, we find m = —1 for |——), and m = 0 for both 
|+—) and |—+). These organize into a singlet 7 = 0, 


“(|+-) = +4), (21.10) 
and a triplet 7 = 1, 
++), 
(cee (21.11) 


In general, we want to know what the coefficients are for the change of basis from |71, 1; j2, m2) 
to the new basis |j, m3 j1, j2), i-e., the coefficients 


(J1, M1; J2,MalJ,M5 jr, J2) - (2112) 


These are known as Clebsch-Gordon coefficients. We now discuss some general properties of 
Clebsch—Gordon coefficients: 


1. The Clebsch—Gordon coefficients are zero unless m = m1 + mg. To see this, note that 


Jz — Siz — Jaz =0 (21.13) 
as an operator. In particular, 
LH dig Jos) 7 jis 2) =O (21.14) 
Then, 
(J1,™1; ja, Me|(Jz — Jt,2 — Jo,z)|J,m5 f1,J2) = 0. (21.15) 


Acting with J, to the right and with J;,, to the left, this gives us 
(m — my — mM2)(j1, 71; J2, MalJ,™M; ji, Je) = 0. (21.16) 
Thus, the Clebsch—Gordon coefficient must vanish unless m = m1 + mo. 


2. The Clebsch—Gordon coefficients vanish unless |j1 — j2| < 7 < j1 + ja. As a consistency check, 
we can see that this condition gives the correct number of states. In the basis |j1, 771; j2.m2), 
we can see that the total number of states is 


(251 + 1)(2j2 +1), (21.17) 


Lecture 21 8.321 Quantum Theory I, Fall 2017 92 


because subsystem 7 has 27; +1 possible states. We can similarly count the number of states 
in the basis |j,m; 71,2), assuming that |j1 — jo] < 7 < j1 + jo. For each j, there are 27 + 1 
states, and j runs between |j; — j2| and j1 + ja. We further assume that every 7 in this range 
occurs, but with two successive j values differing by 1 and not by 5: This gives the total 
number of states as ao 

Jitj2 

S> (QF +1) = (jf. +1)(2jo +1), (21.18) 

J=N1—J2 

where we have assumed 7) > jo, without loss of generality. 


3. If j1 and jg are both half-integer (by which we mean an integer plus 5) or both integer, then 


j is integer. To see this, note that under a 27 rotation, both subsystems will acquire the 
same phase (—1 if they are both half-integer, or +1 if they are both integer), and so the total 
system acquires no phase. Thus, the combined system has integer angular momentum. 


Using the same type of argument, we see that the converse is also true: if only one of (j1, j2) 
is half-integer, then 7 is half-integer. 


21.3. Discrete Symmetries 


We now move on to discuss parity (a.k.a. inversion). Let II be the operator that implements the 
parity transformation. By this, we mean that 


|a) + Hla) := lam) (21.19) 


under the parity transformation x > —a. We require that 


(an|x|ar1) = —(a|x|8) , (21.20) 
which implies that 
Wal =—a«. (21.21) 
In other words, 
{a, II} =0, (21.22) 


where {-,-} is the anticommutator. 

Two successive inversions do nothing, i.e., HI? = 1. This implies that the eigenvalues of II are 
+1. States with eigenvalue +1 are called even under parity, and states with eigenvalue —1 are 
called odd. Operators that are odd under parity must anticommute with I]. For example, p is odd, 
as p — —p under parity, so 


{p, TI} =0. (21.23) 
What about angular momentum? For a general rotation (not necessarily a proper rotation), 
TIR(6,n) = R(O, nj). (21.24) 


That is, parity and rotation commute with one another. This implies that [II, J] = 0. In general, 
J =L+S. We can see that LE = x x p is even under parity, because x and p are each odd. 
Because J and LE are both even under parity, we conclude that S must also be even under parity. 


Lecture 22 8.321 Quantum Theory I, Fall 2017 92 


Lecture 22 (Nov. 27, 2017) 


22.1 Parity 
22.1.1 Some Standard Terminology 


When we refer to a scalar, we mean an observable that is invariant under rotations and even under 
parity. Examples include x? and p?. There is a different type of object, called a pseudoscalar, that 
is invariant under rotations, but odd under parity. An example of a pseudoscalar is the product 
S -«; this is invariant under rotations, but odd under parity because S is parity even and « is 
parity odd. 

A vector is an object that transforms as a vector under rotations and is odd under parity. 
Examples are « and p. A pseudovector is an object that transforms as a vector under rotations, 
but is even under parity. Examples of pseudovectors are L, S, and J. 


22.1.2 Wavefunctions Under Parity 


Eigenstates of parity satisfy 


II|y) = +|y) , (22.1) 


as we know that IT has eigenvalues +1 only. If we take the matrix element with a position ket, then 
we find 


(a|II}b) = +(x) = £y(a) . (22.2) 
On the other hand, we can have the parity operator act on the position ket, giving 
(a|II|b) = (-a|p) = o(-a@). (22.3) 


Thus, wavefunctions of parity eigenstates satisfy 


w(—a#) = £v (a). (22.4) 


We refer to such wavefunctions as even (+) or odd (—). 


22.1.3 Momentum and Angular Momentum 


As we have seen, [p, HI] 4 0. Thus, we cannot simultaneously diagonalize the momentum and parity 
operators, i.e., momentum eigenstates are not, in general, parity eigenstates. 
As an example, consider the free particle 


H=—. (22.5) 


The energy eigenstates 


———¢'p2/h 22.6 
V2th ( ) 
are not parity eigenstates. However, [II, H] = 0, which means that we can choose energy eigenstates 
that are also parity eigenstates in this case. Because the two states 
1 


|p) = =o a (22.7) 


Lecture 22 8.321 Quantum Theory I, Fall 2017 93 


are degenerate, we can choose 


|=p) = Ip) + |-P) (22.8) 


V2 
as energy eigenstates. These are energy and parity eigenstates, but are not momentum eigenstates. 
By contrast, [L, II] = 0, so we can simultaneously diagonalize both orbital angular momentum 
and parity. Under a parity operation, the spherical angles (0, ¢) are sent to 


(0,0) > (7-0,¢47). (22.9) 
This tells us that states with a definite angular position transform as 


We can write the orbital angular momentum eigenstates |?,m) in terms of these states using the 
matrix elements 


(0, dl, m) := Yem(8, 4), (22.11) 


which are known as spherical harmonics. 
The spherical harmonic Yo, is a constant, meaning that 


II|é =0,m =0) = |£=0,m=0). (22.12) 


The @ = 1 states transform together as a vector, i.e., as linear combinations of x, y, z. In particular, 
the = 1,m = +1 state transforms like x +iy; the 2 = 1,m = —1 states transforms like x — iy; and 
the = 1,m = 0 state transforms like z. Because vectors are odd under parity, this tells us that 


II|é=1,m) =—|€=1,m), (22.13) 


or equivalently, 
Vegi 0,0 TT) = Yem(9, o) : (22.14) 


In general, Yp, has parity (—1)°. 


22.1.4 Selection Rules 


Let O be an operator with definite parity, i.e. 


TOW = AO, (22.15) 


with \ = +1. Consider the matrix elements (y|O|q") of this operator with two parity eigenstates 
|) and |r’), such that 
TI) = sl), Tv’) = s'|v"), (22.16) 


with s,s’ = +1. We then have 
(vlOlv’) = (y| MMO |") 
= Ass'(plO|v’) , 


where in the first step we have used II? = 1. This implies that (|O|q’) = 0 unless Ass’ = 1. Thus, 
if O is even under parity (A = +1), then |y) and |w’) must have the same parity for the matrix 
element to be nonzero; similarly, if O is odd under parity (A = —1), then |) and |y’) must have 
opposite parity for the matrix element to be nonzero. This is a selection rule. 

For example, (q|a|w’) 4 0 only when |q) and |7’) have opposite parity. As a corollary, we see 
that the expectation value of x in any parity eigenstate must be zero. 


(22.17) 


Lecture 22 8.321 Quantum Theory I, Fall 2017 94 


22.2 Time Reversal 
Classical physics is time-reversal invariant: Newton’s law 
mé = —VV (a) (22.18) 


is invariant under t > —t,a > a. Thus, if x(t) is a valid solution to Newton’s equation, then so is 
a(—t). 
Consider now the Schrédinger equation, 


a ce V(@) ule ap (22.19) 


If we let t + —t, we see that we can get a solution if we take 
(a, t) > y* (ax, -t) (22.20) 


for some solution w(x,t). This suggests that we should take the time reversal operator O to be 
anti-unitary. (Recall that an anti-unitary operator A can be written in the form A = KU, where 
K is complex conjugation and U is some unitary operator.) Thus, we have 


6(ala) + b|8)) = a*6|a) + 6°68) (22.21) 


for any a,b € C and Ja),|B) EH. 
We now consider combinations of time reversal and time translation operations. Assuming that 
time reversal is a symmetry, we require that 


|b(—dt)) = Aly (6t)) ,  [(0)) = Al(0)) . (22.22) 


By using forward and backward time translations from t = 0, we see that 


jw) = (14 Fat) wo), 
‘as (22.23) 
jo(ae)) = (1- See) OO), 
so the statements |y(—dt)) = 0|~(dt)) and |~(0)) = A|~(0)) imply that 
iH6\xx(0)) = 6(—iF1)|W(0)) (22.24) 
Thus, we have 
iHO = 0(-iH) (22.25) 


as an operator equation. Because 0 is anti-unitary, this tells us that [H, 6] = 0, exactly as expected 
of asymmetry of the Hamiltonian. 

As usual, operators transform under time reversal as O + 606~!. An operator O is even/odd 
under time reversal if 


900-1 = +0. (22.26) 


We require that 
6x0 '=x2, Opd-'=-—p. (22.27) 


There are two ways to see that p must be odd under time reversal: first, we could consider the 
position space representation p > —ihV, and use that fact that @ is anti-unitary; alternatively, we 


Lecture 22 8.321 Quantum Theory I, Fall 2017 95 


can use the fact that time reversal should preserve the commutation algebra |x;, p;] = ihd;;, which 
requires that p be odd because « is even and the right-hand side contains a factor of 7. 
Similarly, in order to preserve the commutation algebra [J;, Jj] = iheijr Jp, we need 


1G SF (22.28) 


meaning that L + —L and S — —S under parity. Note that L? > L?, so this tells us that 
6|£,m) x |€, —m). In particular, 
6|2,m) = (-1)™|£,-—m) . (22.29) 


Here, the phase factor (—1)™ is a convention choice built into the definition of the spherical har- 
monics, 


22.2.1 Time Reversal and Spin 


We find interesting outcomes when acting on spin-5 systems (or systems with other half-integer 
spin) with time reversal. The statement 


Che Say (29:51) 


implies for a spin-5 particle that 


J,0|+) = -0Jz|+) = Fol) ' (22.32) 
Thus, we see that 0|+) « |—). In general, there can be some phase 7, so that 
6|+) =n]-). (22.33) 
We can write this equation in the form 
O|-+) = ne Su/h 4) , (22.34) 


We could have similarly chosen Sz instead of S,, or indeed any spin operator in the z, y-plane. 
Based on this statement, we write 


9 = ne 5u/hK , (22.35) 

where we have included K because @ is anti-unitary. We then have 
6|—) = ne*"u/* K|—) = —n]+). (22.36) 

From this, we see that 
6°|+) = A(n|—)) = n*6|-) = —|nl?|+) = -|+), (22.37) 
where we have used the fact that |6|? = 1 because @ is purely a phase. Similarly, 0?|—) = —|—). 
This means that 

a1 (22.38) 


holds as an operator equation for a spin-5 system. This is true for any system with half-integer 
spin. There is a standard phase choice for spin-5, which is to take 7 = 7, which gives 


0 = ie Su/hK , (22.39) 


Lecture 23 8.321 Quantum Theory I, Fall 2017 97 


Lecture 23 (Nov. 29, 2017) 


23.1 Consequences of Time Reversal Symmetry 
23.1.1 Spinless Particles 


Theorem 5. Consider a system of spinless particles. Suppose that H is time-reversal invariant, 
and that there exists a non-degenerate energy eigenket |n). The corresponding energy eigenfunction 
can be chosen to be real. 


Proof. Because |n) is an energy eigenket, we have 
A\n) = E,|\n). (23.1) 
Because time reversal is a symmetry, we then have 
H(@|n)) = 0A|n) = E,,(8|n)), (23.2) 


which implies 
In) = 6|n) (23.3) 


for some phase 7, because |n) is non-degenerate. The corresponding wavefunction is then 
Un(x) = (2|n) = n(x|O\n) = ne, (2) . (23.4) 


We can then make the redefinition q(x) = n~/?u n(x) in order to get a real wavefunction. This 
completes the proof. 


23.1.2 No Conservation Law 


There is no conservation law associated with time reversal. Even though [0, H] = 0, there is no con- 
cept of a time reversal quantum number. Despite the fact that 6 commutes with the Hamiltonian, 
we find that 

OU (t, to) 4 Ult, to), (23.5) 


where U(t,to) is the time-evolution operator. For example, suppose that we consider an energy 
eigenstate, 


|p) = Ely). (23.6) 

Then, 
w(t) =e */"140)), (23.7) 

while 
A|w(t)) = e**/"b(0)) F |). (23.8) 


Thus, 6|w) = |w) is not preserved under time evolution. 


23.1.3. Kramer’s Rule for Half-Integer Spin 


Theorem 6 (Kramer’s Rule). Time-reversal symmetry implies a two-fold degeneracy all energy 
eigenstates. 


Lecture 23 8.321 Quantum Theory I, Fall 2017 98 


Proof. Suppose we have an energy eigenket |n), 
H|n) = E,|n): (23.9) 


If time-reversal is a symmetry, then 0|n) has the same energy E,,. Suppose that 


6|n) = e*|n). (23.10) 
Then, 
6?|n) = 6(€'*|n) 
= "8\n) (23.11) 
= e 6 |n) 
=n). 
However, for half-integer spin, we have seen that 6? = —1, so this is a contradiction. Thus, 6|n) 


must be a distinct state from |n). Thus, we get a two-fold degeneracy for all energy eigenstates. 
There may be additional degeneracy, but all states will come in pairs in this way. 


23.2 Uses of Symmetry in Solving the Schrédinger Equation 
23.2.1 Symmetric Double-Well Potential 


Consider some symmetric double-well potential V(x) and Hamiltonian 
H=2=A-VG\, (23.12) 
m 


By definition, we have V(x) = V(—2), because the potential is symmetric. 

This Hamiltonian is invariant under both II and 6. The eigenfunctions can thus be chosen to be 
real (from time-reversal invariance) and either even or odd under x + —2 (from parity invariance). 
We can argue that the ground state wo9(a) must be symmetric under « + —<2 (specifically, it has 
no nodes); furthermore, it is physically apparent that the ground state should be peaked near the 
bottom of each well. To see the first point, imagine that the ground state were antisymmetric; in 
this case, we must have w(0) = 0. The squared slope of the wavefunction gives the contribution to 
the kinetic energy. We can then cut the ground state apart at « = 0, negate one side, and then 
glue the halves back together to give a symmetric function (with a small amount of smoothing at 
the origin to make the new wavefunction continuously differentiable). The resulting wavefunction 
has the same potential energy as the supposed ground state wavefunction, but has a smaller slope 
at the origin, meaning it has lower energy than the ground state, which is a contradiction. Using 
this approach repeatedly, we can argue that the ground state has no nodes. 

What can we say about the first excited state y1(x)? Note that if we try to construct another 
wavefunction with zero nodes, it will not be orthogonal to the ground state. The next-lowest energy 
state will have the fewest nodes possible, from the argument above, and so we thus expect the first 
excited state to be antisymmetric with a single node. 

Classically, a particle moving in such a potential will oscillate around the bottom of one of the 
two wells; it must choose a particular well to reside in. This is a classical example of what is called 
a spontaneously broken symmetry. 


Lecture 23 8.321 Quantum Theory I, Fall 2017 99 


23.2.2. 3D Particle in a Spherically Symmetric Potential 


The spin decouples from the orbital angular momentum, and so we will ignore spin for now. We 
see that [LZ, H] = 0, because [J, H] = 0 and [S, H] = 0. We can then choose the energy eigenstates 
to be eigenstates of L? and L,. We then label the energy eigenstates by |n,¢,m), where £ and m 
specify the eigenvalues of L? and L., respectively. The wavefunction can then be written in the 
form 

Onde) = Celnt,m) = Rag?) Yew(6,0). (23.14) 
That is, the radial and angular parts of the wavefunction factorize. We will see the details of this 
in a moment. 

In polar coordinates, we can write 


yp 2 (2) ile 


r2 Or Or 


23.15 
xt of Oe BON, scale ve) 
> Or2  r Or) or?” 

Thus, the Hamiltonian acts as 
h? d dRnv hee +1) 
H Y, = a: a Yess 
Renae ¥eon 8:8) = [ase ( PGE) + I Rae + VERY 99 46 
= EnRneYem . 
We can cancel Y;,,, from both sides of this equation, and it becomes an ordinary differential equation 
for Rne(r). 
From the requirement that (n,@,m|n, @,m) = 1, we have 
- dr r? RF (7) Rne(r) i dQ Y¥p5 (9, 6)Yom(9,¢) = 1, (23.17) 
0 
a 
1 
giving 
dr RQ) hag) = 1 (23.18) 
0 
We can understand this ODE more easily by making the change of variables 
R,e(r) = tnt) (23.19) 
The radial equation then becomes 
hi? a? Une 
~ po gpa Tt Vert tine = Bnttne ; (23.20) 
where Re(041) 
+1 
and as 
| dr |tme(r)|? = 1. (23.22) 
0 


This effective potential has a local minimum (for sufficiently non-singular potentials V(r)), which 
serves to localize the energy eigenfunctions. 


Lecture 23 8.321 Quantum Theory I, Fall 2017 100 


23.3. Approximation Methods 
23.3.1 Time-Independent Perturbation Theory 
Suppose that we have a Hamiltonian of the form 


H=H+V, (23.23) 


where Ho is a Hamiltonian with known spectrum and V is small. We can try to find the en- 
ergy eigenstates and corresponding energy eigenvalues of the full Hamiltonian by treating V as a 
perturbation, and writing these quantities as a systematic power series in V. 

As a formal bookkeeping device, let’s introduce a parameter A, and write 


H = Hy+vV. (23.24) 


We will then expand in powers of (ultimately, we are interested in A = 1, so we will set it to be 
so at the end). Choose an orthonormal energy eigenbasis of Ho, 


Ho|no) = Eno|no), (mono) = dmo,no - (23.25) 
We then assume we can expand the energy eigenstates of H in the form 
|n) = |no) + Afra) + A?|n2) +++, (23.26) 
and the corresponding energy eigenvalues in the form 
En = Eno t+ Ena t+ Enate. (23.27) 


If we plug these expansions into the Schrédinger equation, H|n) = E,,|n), we will get an infinite 
number of equations from matching the two sides order by order in X. Explicitly, we have 


(Ho + AV)(|no) + Alni) + +++) = (Eno + AEn1 + +++ )(|no) + Alma) +++). (23.28) 


Lecture 24 8.321 Quantum Theory I, Fall 2017 101 


Lecture 24 (Dec. 4, 2017) 


24.1 Non-degenerate Time-Independent Perturbation Theory 
We now consider the problem of approximating the spectrum for a Hamiltonian of the form 


H=Ho+V, (24.1) 


where Hp is a Hamiltonian with a known spectrum, and V is a “small” perturbation. This is 
the problem we began discussing in the last lecture. As a bookkeeping device, we introduced a 
parameter A, and wrote 

H=Hj+)V. (24.2) 


This parameter will allow us to organize our power counting analysis, and we can set A = 1 at the 
end. 
We have 
Ho|no) = En,o|no) ’ (mo|no) = Onaacns ) (24.3) 


and we wish to solve 


(Hp + AV)|n) = E,|n). (24.4) 
We expand the energy eigenstates of H in the form 
|n) = |no) + Afra) + A?|n2) +++, (24.5) 
and the corresponding energy eigenvalues in the form 
En = Eno +AEnat+ Ene ts. (24.6) 
This gives us 
(Ho + AV)(|no) + Alina) + +++) = (Eno + AE nt + +++) (Ino) + Alma) +--+). (24.7) 
24.1.1 The First-Order Energy Shift 
We can compare the two sides of Eq. (24.7) order-by-order in \. At O(X°), we get 
Ho|no) = En,o|no) ; (24.8) 
which we already knew. At O(A’), we find 
V|no) + Holni) = Enolm1) + En,1|no) - (24.9) 
If we take the inner product of this equation with |ng), then we find 
(no|V|no) + (no|Ho|n1) = (no|En,olni) + Ent - (24.10) 
The second term on the left equals the first on the right, and so we are left with 
En,1 = (no|V|no) - (24.11) 


This result makes sense, even classically: if we perturb the Hamiltonian, the leading-order shift in 
the energy of an eigenstate is just the expectation of the perturbation in the unperturbed eigenstate. 


Lecture 24 8.321 Quantum Theory I, Fall 2017 102 


24.1.2 The First-Order Correction to the Eigenstate 


Next we want to find |n;), the leading-order correction to the energy eigenstate. Rearranging the 
O(A') terms of Eq. (24.7), we have 


(Ho = En,o)|n1) = (Eni = V)|70) 7 (24.12) 
If we insert 
1 = S/|mo) (mol (24.13) 


on both sides of this equation, the left-hand side becomes 


(Ho — Eno)|lm) = 5 (Ho — En,o)|mo)(mo|r1) = S>(Em.o — En,o)|mo)(mo|na) , (24.14) 


mo mo 


while the right-hand side becomes 


(En, — V)|no) = 2|rmo) (mo|(En,1 — V)|n0) = 2 |mno)( En,1(mo|no) — (mo|V|no)). (24.15) 


We now assert that we can choose (ng|n1) = 0; you can check that this is possible by considering 
the normalization of |n). The left-hand side then becomes 


(Ho — Eno)lm1) = > (Emo — En.o)|mo)(molm) , (24.16) 
mo#Fno 
while the right-hand side becomes 
(En,1 — V)|no) = S_|mo)((no|Vjn0)Snomo — (mo|V|n0)) =— S > |mo)(mo|V|no). (24.17) 
mo Mo#FNo 
Comparing these expressions, we find 
Mo) (mo|V|n 

m= SO |mo)(mo|V |r0) (24.18) 


Eno _ Emo 
mo#no 


Note that this expression only makes sense when the energy eigenvalues of Hp are not degenerate. 
We can now deduce a condition for the validity of perturbation theory: we demand that 
(ni|n1) <1 so that the first correction to the energy eigenstates is small, i.e., that 


|(mo|V|no)| < |En,o — Em,o| (24.19) 


and that their ratio decreases rapidly enough with increasing mo. 


24.1.3. The Second-Order Energy Shift 


Now, we are prepared to compute E,,2, the second-order correction to the energy. At O(A?) in 
Eq. (24.7), we have 
Ho|n2) + V|n1) = Enoln2) + Enilna) + En,2|no) - (24.20) 


If we take the inner product with |no), we can use 


(n0|Ho|n2) = En,o(no|n2) , (24.21) 


Lecture 24 8.321 Quantum Theory I, Fall 2017 103 


to eliminate the first term from each side, leaving us with 


0 
(no|V ni) = Enainetrty ~ + En.2(no|no) - (24.22) 


Because |ng) is normalized, this gives us 


En = (no|Vim) = >> 


MoFNo 


|(no|V |mo) |? 


: 24.23 
Eno — Em, ( ) 


Again, this only makes sense if the energy eigenstates of Hp are non-degenerate. To second order, 


we then have ‘ 
|(n0|V |mo)| 


3 
Pape OV") (24.24) 


Ey = no + (n9|V|no) at oS 


mo#no 


24.2 Examples of Time-Independent Perturbation Theory 
24.2.1 Spin in a Magnetic Field 
Consider a spin-5 in a magnetic field of the form 
B= B,«#+ Boz, (24.25) 
with B, < Bo (assume that both B, and Bp are positive). The Hamiltonian is 


h 
H=-y7B.S =-—7(BoS* + B,S*) = ~ 5 (Boo* + Bzo”). (24.26) 


To analyze this system using perturbation theory, we define 


h h 
Hy= — Boo" - v= 7 Byo®. (24.27) 


To zeroth order, the energy eigenstates are |+) with energy —yhBo/2 and |—) with energy +yhBo/2. 
To second order, the shift in the energy of the state |+) is 


|(=1V|+)/? 
AE, = 
eS CrlV et EE. 
2 2 — yhBo 
_ yh B; 
2 2Bo 
Similarly, we find 
ah ( Bz 
AE_ = — : 24.2 
2 ie) er 


Thus, the higher energy eigenvalue increases and the lower energy eigenvalue decreases. This 
is a general property of non-degenerate, time-independent perturbation theory: the second-order 
perturbation will always lower the energy of the ground state, because the numerator will be positive 
(as it is the sum of squares) and the denominator will be negative (as the ground state has lower 
energy than each other state). 

We know the exact energy eigenvalues for this system are 


h h B? h h(_ B? 
+> B+ Bix Ft Bo(1 ! ats) =+7 Bo+4 (=) + O(B3), (24.30) 
0 


which exactly matches our calculation from perturbation theory. 


Lecture 24 8.321 Quantum Theory I, Fall 2017 104 


24.2.2 The Quadratic Stark effect 


Consider an atom in a weak uniform external electric field. What is the effect on the ground 
state energy? We will not worry about the issue of ionization, as the likelihood of ionization from 
applying a small electric field is very small. Thus, we ignore the possibility that the electron can 
escape from the atom. 

We take the unperturbed Hamiltonian to be 


p 


Hyp = — +X 24.31 
0 _— +Vo(r), ( ) 
With Vo rotationally symmetric, e.g. 
2 
W=-— (24.32) 
r 


for the hydrogen atom. We then take the perturbing Hamiltonian to be 
V =-ekEz, (24.33) 


where we have simply defined the z-axis to be the one pointing in the direction of the electric field. 
The energy shift of the ground state is then 


2 
= (0|V|0) y+ evr 7a = PD, (24.34) 
n#0 


The first term vanishes by symmetry arguments, using either parity or rotational symmetry. Thus, 
the shift starts at second order. We then have 


=@ny, KelelOrT 2 (24.35) 
We 
n#0 


The second-order term is negative. We define a, called the atomic polarizability, such that 
1 
AEo = — 50k? Sect: (24.36) 


We could have written this form down simply by symmetry arguments, but now we have a way to 
calculate its value: 


eae {Kra}z}0)1? ; (24.37) 


In order to proceed further, we need to know the particular value of Vo(r) for the system we 
are trying to solve. For the hydrogen atom, it turns out that this calculation can be done exactly. 
Dimensional analysis gives us most of the answer: we have 


» length? 
energy 


(24.38) 


The only length scale in the hydrogen atom is the Bohr radius ag, and the only energy scale is 
e?/ao, which is a Rydberg. Thus, we have 


22 
an <0 wal, (24.39) 


For hydrogen, the exact answer turns out to be 


a = 4.50%. (24.40) 


Lecture 24 8.321 Quantum Theory I, Fall 2017 105 


24.2.3. Van der Waals Interaction 


An important interaction in chemistry and biology is the van der Waals force. Given two (hydrogen) 
atoms in their respective ground states, separated by a distance r >> ag (with ap the Bohr radius), 
how does the total ground state energy depend on r? 

Take the direction of separation to be the z-direction. We will take the two protons to be fixed 
in position. The full Hamiltonian is 


H=Ho+ iM, 
h? er ag 
Ho= Vit V5 
(at ee (24.41) 
H, = e? , e? e? e? ; 
r |rtro—m| |rtre| |r—-ril 


where r is the vector separating the protons, and 7; are the vectors separating the electrons from 
their respective protons. 

If r >> ao, then we can expand Hy, in powers of r;/r. This is a multipole expansion. The leading 
term in this case is a dipole-dipole interaction, 


e? 1 
Da a (x1 22 + yiy2 — 22122) + O A 
: (24.42) 
= 3 (M1 ry —3(r1-#)(r2-#)) + o(<:) 
We then perturb in H;. The zeroth order ground state is 
Vea = VgalT1)Vea(r2) - (24.43) 
We then have 
AE = (gd|Hi(r1,172)|gd) 
9 (24.44) 
- e(gd| U{XQ + ee 2122 led) 
Note that 
(gd|aixe|gd) = ((gd)1|21|(gd)1)((gd)2|x2|(gd)2) = 0, (24.45) 


and similarly for all terms in AE@). Thus, the leading-order energy shift comes in at second order. 
We find 


a — 22z12z2|0, 0) |? 
AF®) — — 3 (ki, ke|vive + yr yo 212|0, 0)| (24.46) 
ares Foo — Ek: ,k: 
Dimensional analysis tells us that 
44 25 
DECAL Ae ling Ae aU (24.47) 


r® Ry ph? 


where the Rydberg is e?/ag. The punchline is that there is a weak attractive interaction, which 
is the van der Waals interaction. This is a purely quantum mechanical effect, arising because the 
dipole-dipole interaction induces virtual transitions to the excited states in each atom. 


Lecture 25 8.321 Quantum Theory I, Fall 2017 106 


Lecture 25 (Dec. 6, 2017) 


25.1 Degenerate Perturbation Theory 


Previously, when discussing perturbation theory, we restricted ourselves to the case where the un- 
perturbed energy levels were not degenerate. We will now consider the case where the unperturbed 
Hamiltonian has degeneracies. Suppose the eigenstate |ng) is degenerate with |ko), ie., Eno = Exo. 
We clearly cannot use the formula from last class for the first-order correction to the eigenstate, 


|m9) (mo|V|no) 
n) =|no) +A ————_———_., 25.1 
In) = pg) +4 > Qo (25.1) 


mén 


because the denominator vanishes at m = k. 

The solution is to restrict our attention to the degenerate subspace and then diagonalize the 
perturbation on this subspace. Within the subspace of degenerate states of the unperturbed Hamil- 
tonian of energy E,,9 (referred to as the degenerate manifold of Eno), we diagonalize V. Within 
this subspace, Ho « 1 by definition (since every state in this space has the same energy eigenvalue). 
Thus, PHoP « 1, where P is the projector onto the degenerate subspace, and so PH) P commutes 
with PVP. We can then diagonalize the projected V simultaneously with the projected Ho. In the 
orthonormal basis that diagonalizes PV P, the off-diagonal matrix elements of PVP are zero, so 
we can return to using the formulas of ordinary perturbation theory. The terms where E,,9 — Exo 
vanish do not occur, as the numerator also vanishes. 


25.1.1 Linear Stark Effect 


As an example, consider an atom in an electric field. In the case of a hydrogen atom, we found 
that the energy of the electron in the atom changed quadratically in the electric field (for small 
electric field). This was the phenomenon known as the quadratic Stark effect. Now we will discuss 
a situation in which the energy changes linearly in the applied electric field. 

Let us choose EF = EZ. Now, we consider the n = 2 levels of a hydrogenic atom. There are 
four degenerate states (ignoring spin) of the form |n, @,m), which are 


Diets (200, ead... ID0e0s, (25.2) 
et —— 
2p 2s 
Our perturbation is V = —eEz. Consider V in the subspace of the four n = 2 levels. We need to 
compute 
(2, ,m'|z|2,2,m). (25.3) 


Because [z, Jz] = 0, we have 
0 = (2,0, m'|(zJ, — Jzz)|2,£,m) = (m — m’)(2, £,m'|z2|2, 2, m) . (25.4) 


Thus, 
(2,0,m'|z|2,2,.m)=0, ifm’ Am. (25.5) 


Note also that under parity, z + —z, so 
(n, £,m'|z|n,é,m) =0, (25.6) 


i.e., we will only find a non-vanishing matrix element for ¢’ £ @. 


Lecture 25 8.321 Quantum Theory I, Fall 2017 107 


Thus, the only non-vanishing matrix elements are 
(2,1, 0|2|2,.0, 0) = (2, 0, 0|z|2, 1,0)" . (25.7) 


We can compute these entries exactly, because we know the hydrogen atom wavefunctions. Explic- 
itly, we find 


0 3eagE 0 0 

_ | deagk 0 0 0 
V= 0 0 0 of? (25.8) 

0 0 0 0 


where ag is the Bohr radius. This matrix has eigenvalues 0,0,+8ea9F. Specifically, the states 
\2,1,1) and |2,1,—1) have no energy shift, while the state (|2, 0,0) +|2,1,0))//2 has shift +3ea9E 
and the state (|2,0,0) — |2,1,0))/./2 has shift —3ea)H. This energy shift is linear in the strength 
of the electric field, hence the effect is called the linear Stark effect. 

In general, it may be the case that diagonalizing V in the space of degenerate states does not 
split the degeneracy. In such a case, we must go to higher order in perturbation theory, until we 
are able to successfully split the degeneracy, after which point we can proceed by non-degenerate 
perturbation theory. If PVP does not split the degeneracy, then at the next order we will be 
considering operators of the form 


Vim)(mlV 


25.9 
iE, (25.9) 


PVP+ >| P 


m 
Em#Eo 


25.2 Time-Dependent Perturbation Theory 


We now consider time-dependent perturbations to the Hamiltonian. We begin with the Schrédinger 
equation, 


im“ I(t) = HV), (25.10) 
with 
H = Ho + A(t), (25.11) 


where Ho has a known spectrum and Hj is some weak, time-dependent perturbation. 

We may want to compute the probability that the initial Hp eigenstate |i) transitions to the 
final Ho eigenstate |f) in time t. If H; = 0, then the solution to the Schrédinger equation is of the 
form 


W(t) = 5 ene Bn o/F nig) , (25.12) 


where the cy, are time-independent. If Hi(t) 4 0, then there is additional time dependence coming 
from the perturbation; we can still expand any state in the basis of unperturbed energy eigenstates, 
so we can write the solutions to the Schrédinger equation in the form 


W(t) = So en (the Fro/F lng) , (25.13) 


where the coefficients c,,(t) are now functions of time. We could absorb the time dependence in the 
exponential into the coefficients c,(t), but it is notationally convenient not to do so. 
Applying the Schrédinger equation to Eq. (25.13), 


(iho, — Ho — Hy)|d(t)) =0, (25.14) 


Lecture 25 8.321 Quantum Theory I, Fall 2017 108 


we are left with 


S “[ihOven(t) — Ha (then (tle 7!" no) = 0. (25.15) 
Taking the inner product with 
(fole’@s0t/” , (25.16) 
we arrive at qd 
., dC iw 
inst = S°( fol Hi (t)|nope™!*en(t) , (25.17) 
where : . 
win = 7 (25.18) 


We can recognize this as a concrete example of working in the interaction picture; we have removed 
the time-dependence due to the unperturbed Hamiltonian, so the only time-dependence we see 
remaining comes from Hy. 

What we have done so far is exact. We now assume that |(t = to)) = |2) and solve perturba- 
tively in Hy. In the initial state, 


Galt = to) = Oni x (25.19) 
At first order, we then have 
des i sae 
FS = "fy (t) lin) ret 25.20 
which can be integrated to yield 
- at 
cri = Ofi — sf dt! (fol Hi(t’)\ioyes (25.21) 
0 


This is the transition amplitude between the states |29) and | fo) to first order in the perturbation 
Ay. 


25.2.1 SHO in a Time-Dependent Electric Field 


Consider a simple harmonic oscillator in a time-dependent electric field 


E(t) =ege?/”’ (25.22) 
The Hamiltonian is 5 
1 
ce — + sina — exe(t). (25.23) 
m 


If our initial state is the ground state, |?) = |0), then we want to compute the transition amplitude 
to the state |n) after a time t. To first order, we find this probability to be 


2 


i iegee ee 
Prinlt) = 3 / dt! (n|Hy(t)|0)e""] » on £0. (25.24) 


Here, we have used Eno — Eo,o = wn for the SHO. In the limit t — oo, this becomes 


2 
, n#£0. (25.25) 


1 


Jt lene)" 


Lecture 25 8.321 Quantum Theory I, Fall 2017 109 


We can write the perturbation in the form 


H(t) = —ec(t)\/ —— (a . a’) (25.26) 
and so we find 
e? ane @ 
Po-sn(t 4 00) = calf dt! e(t/)e””* (n| (a + a‘) 0) x Ee: (25.27) 


Because a|0) = 0 and at|0) = |1), we see that to this order, Po, =0 for n > 1. For n = 1, we find 


2 2 
ig a dt! eit! ege OP /7? 
2mMAw | Jo 


222 
TEE = 22 
0 2, -wi re /4 


Poil(t =, oo) = 


(25.28) 


~ Imfiw 


If we work to higher orders, we will find nonzero transitions to higher excitation states |n). Note 
that Po_,1 is small both for wr >> 1 (slow pulse) and wr < 1 (fast pulse), and peaks when wr ~ 1. 


25.2.2 Second-Order Transition Amplitude 


Iterating the solution for c,(t) to next order, we find 


, 2 t t! ; ; 
cl?) (£) = (-;) ay dt! dt!” clwnmt Fy nm (t/ emit Hie) (25.29) 
m 2 to to 
where 
Ay je(t) = (91 ()|F) - (25.30) 


Incorporating the higher-order terms, the transition probability is 


2 
Prin = |CP(t) + Bt) +] (25.31) 


Lecture 26 8.321 Quantum Theory I, Fall 2017 110 


Lecture 26 (Dec. 11, 2017) 


26.1 Harmonic Perturbations 
We now study perturbations of the form 
Hy (t) = Ve? 4. Viet, (26.1) 


Here, V is some operator acting on the Hilbert space that depends on the degrees of freedom in 
the system. We assume that V is “weak,” so that we can hope to treat it perturbatively. 

If this perturbation acts for a long time, we expect that it will induce a transition from the 
initially prepared state |i) to some final state |f) where the system has absorbed energy fw from 
the perturbation, i.e., 

Ey — Ey = hw. (26.2) 


In the last class, we computed the transition amplitude for an arbitrary time-dependent perturba- 
tion at first order: 


cri(t y=-i fae ( f| Hi fsyeere’ (26.3) 


For the case of harmonic perturbations, we find 


cpi(t) = he i jetlerw)e + (FIV ipetloretore| 


— ellwy j-w)t 1 eilwpitw)t (26.4) 
5 | (IV) SS | a | | 


Ast > 00, Neat is appreciable if either wr;—w ~ 0, ie., Hy & E;+hw (absorption), or if w»;t+w & 0, 


ie., Ey = Ej — hw (emission). 
We now Sen to the case where w is tuned so that 
E,- Ey, hw. (26.5) 
In this case, the second term in Eq. (26.4) is small compared to the first, so we can write 
1 i ei(wpi—w)t 
(t) ~ —(f|V|z) | ——————_ ] . 26.6 
ert waivia — (26.6) 


The transition probability is then 


Py(t) = lepi(t)|? 
ell 2 
ras ak | 


Vii 
= alsVinP = 
f sin? (a2) : 
= FIV IA) | hens 
) 
In this last line, we have simply used 
r = 0 — | 19/2 Cae - eit?) l 
2 
= 2 sin — (26.8) 
0 
= Asin? — 


Lecture 26 8.321 Quantum Theory I, Fall 2017 111 


Now, consider the function 


(26.9) 


which is shown below: 


At x = 0, we have f(0,t) = t. As x > ov, f(a,t) > 0. The function oscillates with a period 
proportional to 1/t. As t — oo, the function decays to zero very quickly, but f(0,t) = t > co. 
Thus, this appears to be approaching something like a delta function. We can check this by noting 


that 
sin? tr © d(txr) sin? tx 
_ ae = 
ie t ag 


L. du sin? u (26.10) 


which we see is independent of t. We conclude that 


. sin’ tx 
Jim 7 = 70(e). (26.11) 
Thus, 
; wy 1 ap Wr—-W\ In 1 12 
Jim Pal = galtsivinPns(P2=*)2— Figivin Peep —wye. (20.12) 


This may seem odd, because the result we found is that as t > oo, the probability of transition 

diverges to infinity, because it is proportional to t. The interpretation here is that there is a constant 

transition rate, 

Pri aa 
t 


Ri = sl(F|V |i) [75 (wy; — w). (26.13) 


Lecture 26 8.321 Quantum Theory I, Fall 2017 112 


In general, there will be several possible final states |f) to which H, can induce a transition. 
The total transition rate is then given by the sum over all final states |f) of this rate. We define 
the density of states p() such that p(F) dE is the number of states between EF and E+dE. Then 
the net transition rate out of the initial state is given by 


Wiout = S> Ry 
f 


, (26.14) 
T F 
= f aby ENTEIAV IPOs ~ 4 —w). 
Changing variables in the delta function, we reach 
21 . 
Wiour = f aby AE) |(/IV |) PO Ey — Bi — hh). (26.15) 


This famous result is known as Fermi’s Golden Rule. 


26.1.1 The Photoelectric Effect 


Consider an electron in a hydrogen atom interacting with an external EM field. This system has 
Hamiltonian 


_(p-£A)’ 
H=<—e - (26.16) 


where A is the external vector potential. Take the vector potential to be of the form 
A = Aocos(wt —k-r), (26.17) 


and assume that Ao is weak. We can then write 


2 2 2 


e 
= Ade Ay. Ae 26.18 

mM =T ee 7 P)+ > 2 ( ) 
H 
(0) 


where Ho is the hydrogen atom Hamiltonian. The final term, proportional to A’, does not con- 
tribute to the problem we are considering at first order in perturbation theory, so we will simply 
drop it here. 

It is convenient to work in Coulomb gauge, V - A = 0. In this gauge, 


(p:-A+A-p)wW =(-ihV-A+2A:-p) =2A-py. (26.19) 
Thus, we can use 
H= Ao + Ay (26.20) 
with é 
Hy, = —— cos(wt—k-r)Ao-p 
ne eas ‘tas ee (26.21) 
= i(wt—k-r —iwt-k-r)\ A... 
ra aliens aA 


The first term gives the transition rate proportional to é(Ey — E; + hw), while the second term 
gives the transition rate proportional to 6(E — E; — hw). 


Lecture 26 8.321 Quantum Theory I, Fall 2017 113 


If we take the atom to initially be in its ground state, then only the second term can contribute, 
so Fermi’s Golden Rule gives 


20 € ier |2 
Rog = = Do |Mlegl5=—Ao pe*|0)| (Ey — Eo — hw) (26.22) 
ky 


Here, |0) denotes the ground state of the hydrogen atom, with wavefunction 


1 


a, (26.23) 
(mag) 


Yo(@) = (x|0) = 


where ag is the Bohr radius. The ket |k) is a plane wave state with wave vector ky, which describes 
the electron that is kicked out of the atom by the perturbation. 

We can compute the matrix element in the transition rate by putting the system in a box of 
side length L, which gives 


—ike-r —r/ao . ne 
ce . : Pp f e ” e : ae x: —ike-r wie ae 
Se fa a ae ale (ma3)"/? 7 oe | oe ute )) (nag)? 


7 eh Ao-kf [es on ikp tor /ao 
2me 73/2 (ma3)"? 


— eh Ao- ky 81 
2mc 13/2 (rae) 1/2 7 é n 1/a) : 


ik-r 


(26.24) 


Note that we have ignored the factor of e€ in this matrix element; this was not an accident. For 
typical wavelengths involved in photo-ionization, |k| < |kf|. 
For the sum over final states, we make the replacement 


Bk 
» = x8 | ans (26.25) 


The logic is that, in a box of size L, the set of allowed momenta has spacing 27/L, and so each 
differential element in the integral is of the form 


dkz 
26.26 
which gives the form of the replacement above. 
Putting everything together, we then have 
Qn L? f dk ( eh \?(Ao- ky)? 81)? 
Ro+f = ; Ti i; Ons (<=) ( of (en) 7O(Ey — Eo — hw) 
> “305 Gi -- 1/8) 
= iad / ee, (= Eo ns) (26.27) 
= 2°25 3 4 5 
2m?c?.a9 (27) («3 ie 1/a2) 2m 


- e*ke i dQ (Ao 7 ky)? (87)? 
a 2 49 3 4? 
2mc agjh (27) (ie + 1/08 ) 


Lecture 26 8.321 Quantum Theory I, Fall 2017 114 


with ky determined by energy conservation. The only angular dependence is in Ag - ky, so using 
4 
[x cos? 6 = = (26.28) 


we reach the final result, 
W6e2k2A2 


3mc?hae cs 
0 (kz + 1/48) 


Roof = (26.29) 


